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ABSTRACT 


The purpose of this study is to relate teacher leader 
behavior to pupil growth within specific theories of leader- 
ship and teacher effectiveness. The theory of leadership 
adopted in this study identifies two dimensions of leader- 
ship. One of these, Consideration, is defined as behavior 
oriented to the maintenance of personal security, friendli- 
ness, and satisfaction in group processes. The second, 
Initiating Structure, is defined as behavior establishing 
goals, institutional demands, and orderly procedures. 
Effectiveness is defined as the degree to which teacher 
behavior produces pupil achievement. Pupil growth is 
achievement consistent with an external goal. The achieve- 
ment accepted as growth in this study is measured by 
Alberta provincial examinations in Grade IX Social Studies 
and Mathematics. 

The study involved fifty-one teachers and the pupils 
in their classes. Measures of teacher leader behavior were 
obtained from ratings given by principals, teachers, and 
students using the Teacher Leader Behavior Description 
Questionnaire. Achievement and ability measures were 
obtained for each class and, by using a regression equation, 
the predicted achievement of each class was calculated. 


The final growth index is a z-score comparing predicted and 
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actual achievement in classes. 

The measures of leadership revealed significant 
agreement among the three groups of raters. Indices of 
consensus among judges gave coefficients significant at 
the .01 level. Within each judging group, the dimensions of 
leadership are significantly correlated indicating lack of 
independence between the dimensions of leadership. 

The relation of the leadership ratings to growth 
was calculated by using both parametric and non-parametric 
analysis. Taken singly in parametric analysis, both 
dimensions of leadership are significantly related to growth. 
Corrected multiple correlations, ek's, were calculated to 
relate both dimensions simultaneously to growth. For 
students and teachers, ck's of .41 and .40 were obtained. 
These coefficients are significant at the .05 level. 
Multiple correlation based on principals' ratings of 
leadership did not yield significant relationships. The 
non-parametric analysis generally revealed results similar 
to those obtained in parametric analysis, though in some 
cases the parametric analysis revealed relationships not 
appearing in the less powerful non-parametric analysis. 

It was concluded that teachers showing a high degree 
of leadership tend to induce high achievement in their 


pupils. 
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CHAPTER I 
INTRODUCTION AND STATEMENT OF THE PROBLEM 


The improvement of education depends greatly upon 
evaluation. Complex organization of personnel, programs, 
and pupils have been created to fulfil the need for 
schooling in society and it is a basic concern of adminis- 
tration to adapt these organizations to meet fundamental 
aims and purposes in education. Conscious adaptation 
implies evaluation of organizational systems, assessment of 
effectiveness, and change in the organization and its 
behavior. Part of professional concern in education is 
therefore the search for knowledge about systems of edu- 
cational organization in the hope that such knowledge will 
permit change and improvement of those systems. Thus as 
knowledge about the operation of educational systems 
increases, it becomes possible to initiate changes which 
increase the extent to which the fundamental aims and 
purposes of education can be achieved. The kind of know- 
ledge which permits rational, purposeful change is that 
knowledge which is precise, objective, and communicable. 
That is, it is the kind of knowledge which is derived from 
research and is characteristically scientific. It is the 
kind of knowledge which is the reverse of that which has 


mistakenly been called "art" and which has often been the 
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only tool for shaping administrative change. (9) 

Teachers form a large part of the educational setting 
and the evaluation of their effectiveness is therefore an 
important area in which reliable knowledge is required for 
the establishment or modification of administrative arrange- 
ments. The point is that teacher effectiveness is evaluated 
despite the lack of scientific knowledge about effectiveness 
and despite the failure of research to devise reliable 
methods for evaluating teachers. (1, 5, 14) The problem of 
teacher effectiveness is one which has attracted much 
research and yet the results have been largely disappointing. 
The results have not been adequate, for instance, to permit 
the construction of completely satisfactory teacher selection 
programs such as a director of personnel for a school system 
might use. However the basic problem remains. Adequate 
programs of teacher training, prediction of teacher success, 
supervision, and personnel selection, all wait upon valid 
methods of teacher evaluation. The McBeath study of teacher 
effectiveness is significant in that it links teacher 
behavior with effectiveness and works out an experimental 
design within a theoretical framework that contrasts sharply 
with previous haphazard approaches. (12, pp. 26-30) The 
McBeath study, while it produced findings pointing to 
interesting relationships between leadership and effective- 


ness, left these relationships ill-defined and ambiguous. 
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The study points to the need for an investigation in a setting 
where the criterion of effectiveness is sharply and objective- 
ly defined and where the criterion is clearly independent 


of the measure of leadership. (13) 


I. STATEMENT OF THE PROBLEM 


Teacher effectiveness may be defined as the degree 
to which the behavior of a teacher in a given setting 
produces pupil growth consistent with some external goal or 
purpose. Thus teacher effectiveness may be seen to have 
three elements--teacher behavior, pupil growth, and the 
external goal. The problem of this thesis is to relate the 
behavior of teachers as leaders to pupil growth as measured 
by examinations conducted in Grade IX by the Alberta Depart- 
ment of Education. In this case the goals implied by 
effectiveness are those appropriate to the Alberta curriculum 
and it is assumed further that the Grade IX departmental 
examinations are, in part at least, a measure of the degree 
to which those goals have been achieved. 

A limitation placed upon this study is that it does 
not attempt to measure any teacher variable except leadership. 
Leadership is a behavioral variable which will be opera- 
tionally defined for use in this study. Various teacher 
characteristics such as knowledge, training, or experience 


may be related to leadership as a behavior variable but this 
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study does not attempt to ascertain what they might be and 


does not seek to establish their relationship to leadership. 
II. THE SPECIFIC PROBLEM 


The central concern of this study is to study the 
relationship of teacher leader behavior to pupil growth when 
three separate ratings of teacher leadership are obtained 
and when each rating is related to a single criterion of 


pupil growth. 


Sub-problems. 


(1) To describe the leader behavior of teachers when 
ratings of leadership are obtained from principals, other 
teachers, and students. 

(2) To determine an index of growth for a class taught 
by each teacher rated for leadership. 

(3) To study the consistency among the three separate 
ratings of the leader behavior of teachers. 

(4) To study the consistency with which these three 
separate ratings of teacher leader behavior are related to 


the single criterion of pupil growth. 
III, DEFINITION OF TERMS 


Initiating Structure-in-Interaction. Initiating 


Structure-in-Interaction is also referred to as Initiating 
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Structure and is one of the two dimensions of leadership 
measured by the Teacher Leader Behavior Description Question- 
naire (TLBDQ). Initiating Structure is that aspect of 
leadership which deals with establishing clearly defined 
organization, expectations, procedures, and meansof communi- 


cation. 


Consideration. Consideration is the second dimension 
measured by the TLBDQ. It is that aspect of leadership 
associated with warmth of feeling, pleasure in group processes, 


friendliness, and respect. 


S-Score. An S-Score is the score derived from the 
TLBDQ@ which represents the degree to which a teacher mani- 


fests Initiating Structure. 


C-Score. A C-Score is the score derived from the 
TLBDQ which represents the degree to which a teacher mani- 


fests Consideration. 


Leadership style. Leader behavior is described on a 
graph whose X and Y axes represent the S-scores and C-scores 
of teachers. This graph is then divided into four quadrants. 
Leadership style refers to the quadrants as they represent 
the two dimensions of leadership. Quadrant I represents 
teachers who are high in both the dimensions and Quadrant III 


represents teachers low in both dimensions. Quadrant II 
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represents teachers having high C-scores and low S-scores. 
Quadrant IV represents teachers having low C-scores and high 


S-scores, The leadership styles are illustrated in Figure l. 


Judges. The judges are those groups using the TLBDQ 
to rate the leader behavior of teachers. The judging groups 


are principals, other teachers, and students. 


Teachers. This group is made up of teachers of 
Grade IX social studies and mathematics for whom leadership 
ratings were obtained. They are also designated as the 
"participating teachers" or the "experimental group of 
teachers" to distinguish them from the "other teachers" in 


the school. 


IV. DESIGN OF THE STUDY 


The design of the study involved fifty-one teachers 
of Grade IX and the pupils in their classes. The leader 
behavior of these teachers was described by three groups of 
persons. The principals of the schools in which the teachers 
were located comprised the first group. The principal 
completed one copy of the TLBDQ for each participating 
teacher in the school. Groups of five other teachers in the 
school completed TLBDQ's on each of the participating 
teachers and the scores derived from these represented the 


leader behavior ratings of the second set of judges. Ten 
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FIGURE 1. 


QUADRANTS OF LEADERSHIP STYLE ON A GRAPH OF THE C-SCORES 
AND S-SCORES OF PARTICIPATING TEACHERS 


pupils, randomly selected from the participating teachers' 
classes, contributed a third set of ratings of the teacher 
leader behavior. 

Bach of the fifty-one teachers was selected by virtue 
of the fact that he was either a teacher of Social Studies 
or Mathematics in a school willing to participate in the 
experiment. The Grade IX examination results of the classes 
taught by the participating teachers were analyzed to derive 
growth indices. The growth indices of the classes were then 
correlated with the S- and C-scores of the participating 
teachers. Chi-square tests were also used to relate growth 
to the dimensions of leadership. Leadership style was 
related to growth by using multiple correlation and by using 
Fisher's exact probability. The agreement among judges 
about leader behavior was analyzed by correlating between 
group ratings of the dimensions of leadership. Kendall's 
coefficient of concordance was used to determine an over-all 


agreement between judges about leader behavior. 
V. INSTRUMENTATION 


The instrumentation of the study depended upon the 
use of the TLBDQ for measures of leadership and upon the 
Departmental examinations in Mathematics and Social Studies 


for the growth indices, 
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TLBDQ. Ratings of teacher leader behavior were 
obtained by using the instrument developed by McBeath from 
the Ohio studies of leadership. (See Appendix A) The Ohio 
studies represent a body of extensive research in leadership. 
These studies produced a deceptively simple instrument, the 
Leader Behavior Description Questionnaire (LBDQ), which was 
used to relate the behavior of various kinds of leaders to 
measures of effectiveness. (15) McBeath, in his study of 
teacher leader behavior, adapted the LBDQ for use in school 
settings and called the new instrument the Teacher Leader 
Behavior Description Questionnaire (TLBDQ). The TLBDQ is 
largely a replication of the LBDQ with two exceptions. The 
number of negatively scored questions was reduced from three 
to one and two items in the Consideration dimension were 
altered to include "behaviors . . . considered important by 
students." (12, p. 39) It is apparent from the study that 
McBeath considered the TLBDQ a successful instrument though 
the question of the reliability of the leadership dimensions 
is raised by some of the item analysis results. (12, p. 39; 
pe 217) 

In his manual for the LBDQ, Halpin describes the 
essential features of the instrument and this description 
applies equally well to McBeath's TLBDQ. 

The Leader Behavior Description Questionnaire (LBDQ) 


provides a technique whereby group members may describe 
the leader behavior of designated leaders in formal 
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organizations. The LBDQ contains items, each of which 
describes a specific way in which a leader may behave. 
The respondent indicates the frequency with which he 
perceives the leader to engage in each type of behavior 
marking one of five advers: always, often, occasional- 
ly, seldom, never. These responses are obtained from 
the members of the leader's immediate work-group, and 
are scored on two dimensions of leader behavior. For 
each dimension, the scores from the several group members 
are then averaged to yield an index of the leader's 
behavior in respect to that dimension. (11, p. 1) 


The response to each item is scored on a scale from 4 to O 
corresponding to the responses, always, often, occasionally, 


seldom, never. For the negative item, the scoring scale is 


reversed from 0 to 4 for the same responses. 


Growth measures. The measures of pupil growth were 
derived from the Alberta Department of Education examina- 
tions which are given to all students in the province at 
the end of Grade IX. The leader behavior of the teachers in 
this study was related to their teaching in Social Studies 
or Mathematics classes. The results of the pupils in these 
elasses were collected from the Department of Education. 

At the same time examinations in school subjects are being 
conducted, all students in the province also write the 
Cooperative School and College Ability Tests (SCAT). Thus 
for all pupils in the experimental classes, two measures 

were obtained=--one a measure of academic ability and the 
other a measure of academic achievement. The SCAT measure 

is identified as the independent variable and the achievement 


measure in either Mathematics or Social Studies as the 
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dependent variable. In the following sections of this report 
the SCAT measure will be referred to as the X variable and 
the subject matter measure as the Y variable. 

Using these two variables, the correlation between 
pupil ability and achievement may be determined and using 
the distributions of the X and Y variables, a regression 
equation may be derived to calculate a predicted Y score. 

In the following sections of this report, this predicted Y 
score will be referred to as Y'. It should be noted that the 
X and Y will be used not to refer to individual pupil 
achievement but to the average ability and achievement of an 
entire class. When individual pupil scores are referred to, 
the terms pupil X and pupil Y will be used. 

To derive an index of growth, it is only necessary to 
compare predicted class achievement with the actual class 
achievement (Y - Y'). To make these differences comparable 
between classes, it is necessary to change them to z-scores 
and this may be done by dividing these differences by the 
standard error of the predicted Y'. Thus the growth index 
of a class is defined as the difference of predicted achieve- 
ment from actual achievement divided by the standard error 
of the predicted achievement: 

G=Y- Yt 
One! 
This technique is based upon a discussion of the standard 


error of the predicted Y' which may be found in Garrett, 
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12 
Guilford, and Ferguson. (6, pp. 223-4; 8, pp. 160-2; 10, 
pp. 372-3) 

SCAT. Since the Cooperative School and College Tests 
(SCAT) play such a large part in the definition of pupil 
growth used in this study, they may be regarded as an 
instrument of the study. The role of SCAT in the growth 
index is to predict academic achievement and it is precisely 
for this purpose--prediction--that the SCAT was devised. 
Cronbach describes the test as lying mid-way between tests 
with a maximum educational loading and those with a minimum 
educational loading. (3, pp. 234-37) Thus the SCAT falls 
between tests that are measures of subject-matter proficiency 
and tests that are designed primarily to measure more pure 
ability such as Spearman's g. The SCAT manual emphasizes its 
role as a predictor of academic success. 

The tests in the SCAT series have been designed and 
developed for the principal purpose of helping teachers 
and counselors--and students themselves--to estimate 
the capacity of each individual student to undertake 
the academic work of the next higher level of schooling. 
eo ire CAE Piet) 

Reliance upon SCAT is further encouraged by the 
technical competence with which the instrument has been 
assembled. It is the product of careful test construction 
and recent reviews of the instrument are cautiously opti- 
mistic about it. 

Undoubtedly, SCAT is a superior test series. It 


clearly shows the result of careful planning, an 
excellent experimental programme, and the use of sound, 
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13 
up-to-date statistical procedures. (2, p. 455) 
In studies of concurrent validity, correlations ranging from 
-50 to .70 were obtained between the various scales of the 
test and school grades. (2, p. 451) 

The SCAT tests are composed of four sub-tests from 
which three scores are derived--a verbal, a quantitative, 
and a total score. The meaning of these scores are perhaps 
the weakest point in the tests. 

There are a number of .. . deficiencies in the SCAT. 

In the first place, a number of doubts are raised by the 
quantitative score: in the upper grades the items are 
apparently too easy, at least for superior students... . 
The validity study shows disturbingly high intercorre- 
lations between the quantitative and verbal score. ... 
One wonders whether the quantitative subtests are 
indispensible and whether their omission would greatly 
reduce the effectiveness of the test. (2, p. 455) 

In calculating the growth index of the experimental classes 

in this study, only the total SCAT score of the pupils was 

used. The correlations of these total scores with Social 

Studies and Mathematics achievement were revealed to be 

-77 and .78 respectively. The number of students involved 

was 1,462 and this number with the high validity correlation 


was felt to provide a good reason for using the SCAT total 


score as a predictor of student achievement. 
VI. HYPOTHESES 


The research hypotheses of the study may be stated as 


follows: 
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(1) Teacher leader behavior is positively related to 
pupil growth. 

(2) There is significant agreement among the judgements 
of principals, other teachers, and students about teacher 
leader behavior. 

It will be noted that the first hypothesis implies a 
one-tailed analysis in determining levels of significance. 
Stated as null hypotheses, the working hypotheses of this 
study are as follows: 

(1) There is no significant agreement among the three 
groups of judges in their descriptions of teacher leader 
behavior. 

(2) The behavior of teachers as Initiating Structure is 
not Boba igen related to pupil growth. 

(3) The behavior of teachers as Consideration is not 
positively related to pupil growth. 

(4) No judging group's description of leadership is more 
highly related to pupil growth than is that of any other 
group. 

(5) No leadership style of teachers is more highly 


related to pupil growth than is any other leadership style. 


VII. PROCEDURES FOR TESTING THE HYPOTHESES 


Agreement among judges. The three groups of judges 
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describing the leader behavior of teachers are principals, 
other teachers, and students. In the case of principals, 
there is only one TLBDQ completed for each teacher since 
there is only one principal in each school. Thus the 
principal's assessment of the leadership dimensions becomes 
the S- and C-score of the teacher directly. However, in 
the case of other teachers and students, more than one 
person is asked to complete the questionnaire on the teacher. 
Thus for every participating teacher there are ten descrip- 
tions of leader behavior by students in the teacher's class 
and five descriptions by other teachers in the school. To 
obtain the students description of the dimensions, the ten 
S-scores and C-scores on all ten papers are totaled and 
averaged to obtain a single student description of teacher 
leader behavior. A similar procedure is used with the five 
TLBDQ's completed by other teachers in the school to obtain 
the other teachers description of teacher leader behavior. 

To ascertain the agreement between the three sets of 
judges each group's S-scores are correlated with those of 
the other two groups. Similar correlations are made among 
C-scores to ascertain agreement on this dimension. Within 
each group, the S-score may be correlated with the C-score 
to determine the extent to which each is independent of the 
other. 


In establishing the agreement among the various judges, 
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correlations will be computed using both Pearson's r and 
Spearman's rg. Similarly these two measures are used to 
establish the independence of the two dimensions. Kendall's 
coefficient of concordance is used to establish the overall 
agreement between the three sets of judges on each of the 


dimensions of leadership. 


The relationship of leadership to growth. In relating 


the dimensions of leadership to growth, correlations are 
calculated between growth measures and teachers! C-scores 
and S-scores separately. Pearson's r is used for these 
correlations. These correlations are calculated for each of 
the three groups of judges. 

The relationship of leadership to growth is also 
sought through the use of a non-parametric analysis. The 
growth indices are divided into three equal sections 
representing high, average, and low growth. Teachers having 
high and low growth indices are compared on the dimen- 
sions of leadership. Each of the two dimensions of leader- 
ship are divided into high and low groups by dividing the 
range on each dimension at the median. Thus a four fold 
table is derived for thirty-four teachers comparing them for 
high and low growth and high and low Consideration or 
Structure. The significance of the observed distribution in 
this table is determined by a chi-square analysis using a 


goodness of fit test with equal probability determining 
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expected values. 

Two procedures are followed to test the relation of 
the quadrants of leadership style to growth. First a 
multiple correlation is calculated between the two dimensions 
of leadership and growth. Secondly, the quadrants of 
leadership are compared using Fisher's exact probability. 
The dichotomies represented in this test are the teachers 
in any two of the quadrants and their growth indices split 


into high and low groups as described above. 
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CHAPTER II 
RELATED LITERATURE 


The evaluation of teachers is an old and well- 
established practice. On an informal level, pupils, adminis- 
trators, parents, and the public are accustomed to thinking 
in terms of "good teachers" and "poor teachers". Yet, while 
no one would wish to dispute the truism that some teachers 
are more effective than others, immense difficulties have 
been encountered in attempting to reduce the "goodness" 
which the teacher manifests into an objective measure. 
Despite such difficulties, the evaluation of teachers pro- 
ceeds as if the ratings applied had some objective and 
rational basis. In some cases, the evaluation employs only 
very crude, ad hoc methods, while in others, elaborate and 
expensive techniques have been employed. (3) The salary 
scales under which teachers are employed operate on the 
assumption that years of training and experience are valid 
indicators of teacher merit. Thus, for the personnel officers 
of school systems, for supervisors of instruction, and for 
teacher training institutions, the development of precise 
measures of teacher effectiveness and reliable predictive 
techniques are essential to the accomplishment of many 
important aims. If the good teacher and the poor teacher 


ean be identified objectively, problems in hiring teachers, 
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in training of teachers, and in improvement of instruction 


are immensely simplified. 


I. BASIC PROBLEMS IN EVALUATION OF TEACHERS 


The basic problem in the evaluation of teachers may 
be seen to lie in a small but critical number of issues. 
The first problem is to identify the factors which are 
causative or predictive of teaching success. Secondly, 
criteria must be established by which the success of the 
teacher can be judged. Finally, there is the need for a 
general theory or framework within which research may pro- 
ceed, Pervading all of these problems is a general one 
connected with the construction of satisfactory instruments 
of measurement and the development of techniques for their 


application. 


Teacher variables and earlier research. The 
characteristic which distinguishes almost all research into 
teacher effectiveness is the search for causative variables 
regarded as lying entirely within the teacher. That is, the 
"soodness" of a teacher was seen as an inherent quality like 
the inherent qualities which make apples red or water wet. 
To the extent that these hypothesized causative factors were 
present, the individual would make a good or successful 


teacher. Evans (7) points out that this procedure confuses 


the two separate issues of teaching ability and teaching 
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22 
competence. Teaching ability assumes that there is some 
quality of persons which, if we could measure it, would be 
an indicator of success in teaching whether that person had 
ever been in control of a classroom or not. Teaching 
ability, therefore, refers to capacity to teach. Unfortunate- 
ly, it is precisely such a capacity which has proved so 
elusive and difficult to measure. Teaching competence 
refers only to the performance of an individual under given 
conditions and in relation to certain criteria. Remmers (19) 
states that the search for single or multiple traits of 
teachers as indicators of success has lead to an over- 
generalized and oversimplified view of the complex process 
of teaching. 

The concept of teacher qualities as the ultimate 
source of teaching success has been so compelling a one that 
researchers have examined many kinds of teacher traits and 
much research has been directed towards isolating the relevant 
teacher variable or variables that are indicative of success. 
Ackerman (1) lists eight such variables which have been the 
subject of extensive research: 

- Age of the teacher 

- Attitude of the teacher to teachers and teaching 
. Experience of the teacher 

- Intelligence of the teacher 

. Professional information of the teacher 

- Personality of the teacher 

- Social attitudes of the teacher 


Training of the teacher 


In addition to these, Evans (7) adds physical characteristics, 
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23 
interests, and scholarship of the teacher. 

Many of these research projects have been well designed, 
carefully controlled, and subject to extensive checking through 
corroborative studies. The results, despite highly sophisti- 
cated techniques of correlation and factor analysis, have 
been largely disappointing. During the past thirty years, 
the problems of teacher evaluation have been subjected to 
hundreds of research projects and yet no coherent or con- 
sistent evidence can be discerned as a result. The per- 
sistence with which research workers have sought causative 
factors within the teacher himself is readily understandable, 
since no one is apparently ready to state that variables 
within the teacher can be unrelated to the success which 
he will have in the classroom. However, the obvious lack of 
success of this kind of research must argue that however 
important teacher variables are, they are not directly or 
solely responsible for the success of the teacher. 

The unreliability and inconsistency of research into 
teacher variables has been well illustrated by Barr (3) in 
his commentary upon a series of extensive and well-designed 
experiments carried on under his direction at the University 
of Wisconsin. 

The correlations of the Washburne Social Adjustment 

Inventory, for example, vary from .06 to .47 depending 
on the criterion and the setting from which the data 
were derived. The correlations for the American 


Psychological Examination ran from -.12 to .57. Rolfe 
. . . reported... a correlation of -.10 for the 
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American Council Psychological Examination . .. which 

is considerably lower than that reported by Roster (.57) 

andehabuke. (254)... (&; p. 202.) 
In these studies being commented upon by Barr, one is 
impressed first by the frequent occurrence of very low 
correlations and by the lack of agreement between the various 
researchers. 

In explanation of these difficulties, Barr cites 

"the well-known fact that coefficients of correlation cannot 
be taken at face value." (3, p. 202) If this is true in 

the case of the coefficients he reports, one might wonder 
how the research itself can be taken at face value. However 
Barr goes on to explain his difficulties: 

- - e Individual items may not discriminate between 
good and poor teachers. The conventional item analysis 
technique thus seems not to work here; at least the 
discriminating power of individual test items seem to 
be lost in errors of measurement or something. (3, 
pp. 202-3) 

Barr leaves the reader to conjecture what that something 
might be. However, in a later review of the findings of 
research, Barr (4) eriticizes research which proceeds as 
though "the qualities in question resided entirely in the 
teacher." He admits further that "no one appears to have 
developed a satisfactory working plan or system that can be 
used by personnel officers who make judgements about teacher 
effectiveness." In modification of the earlier teacher- 


variables approach, he suggests that effectiveness is a 


quality not of the teacher alone but of the relationship 
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between teachers and pupils. 


The problem of criteria. Part of the difficulty in 
arriving at consistent results in teacher evaluation has 
been the semantic difficulty occurring when a criterion 
meaning "good" is employed to identify effectiveness. The 
source of this problem and a method of avoiding it is well 
stated by Levin. (15) He points out that researchers have 
often combined the function of collecting facts and that of 
making value judgements about them. These functions, if the 
research is to be valid, must be separate. Thus the 
researcher may properly investigate the relation between two 
measured variables. If at the same time he asks that the 
teacher be rated in terms of "goodness," a value judgement 
extrinsic to the research process is introduced. When a 
teacher is rated by a principal or superintendent as to 
"goodness," each rating will represent answers from different 
frameworks of values. Thus to use "goodness" as a criterion 
of effectiveness is to compile as many kinds of effective- 
ness as there are notions about what effectiveness ought to 
be. By using a criterion of "goodness," both the criterion 
and the effectiveness measured by it are really within the 
same dimension. In other words, what is effective is defined 
as what the rater believes is effective. 

The problem of what goal ought to be the object of 


teacher effectiveness is not answerable through research. 
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Research can measure how effectively certain goals have been 
attained. Teacher A may be effective in achieving Variable 
K, while Teacher B is effective in achieving Variable Y. 
Thus, Teacher A exceeds Teacher B in the teaching of number 
facts, but Teacher B exceeds Teacher A in the teaching of 
expression through written language. The "goodness" of these 
teachers can only be assessed in terms of a second and 
distinctly separate value criterion. Moreover, Ackerman 
(1, pp. 284-5) states that the researcher cannot even compare 
the degree of effectiveness of these two teachers since 
they have been measured on two different scales. Levin 
concludes, however, that the "good" teacher can be defined 
in terms of the findings of research: 

The ultimate description of the "good" teacher 
involves a choice which ideally uses research results, 
but cannot be exclusively determined by them. "Science 
can tell us how to get there but not where to go." 

(13 Be 99) 

It is clear, therefore, that notions of "good" and 
the ultimate criteria of teacher evaluation are bound up 
with notions about the aims or goals of education. This 
realization is stated in much of the current writing on 
teacher evaluation research. (1, 6, 7, 8, 12, 15, 19, 22) 
Moreover, within the broad goals of a total educational 
system, there may be different kinds of effectiveness 
required to accomplish the over-all goal. For instance, 


both verbal facility and mastery of facts may be required of 
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pupils. Thus the effectiveness required of the teacher of 
English may be different from that required of the teacher 
of Mathematics. Indeed, it is only in terms of a specified 
goal that the term "effectiveness" can have any precise 
meaning for the evaluation of teachers. In other words a 
teacher will be regarded as effective in the degree to which 
he accomplishes goals that are held to be worthy of 
accomplishment. As Gage and Orleans state (8, p. 296), 
there are really two problem areas for teacher evaluation-- 
the determination of social values and the assessment of 


efficiency in teaching them. 


Gage and Orleans (8, pp. 296-98) also make a useful dis- 
tinction in regard to criteria. Though the ultimate criteria 
of effectiveness must be held to be the values held by 
society, the researcher must work with smaller, clearer, 

and more manageable criteria. These are secondary criteria. 
In their most convenient form, they are held to be the 
effects produced by teachers on their pupils. The facili- 
tation and direction of pupil growth is presumably why 
society organizes schools and the use of pupil growth as an 
intermediate criterion seems amply justified. However, 
further questions are raised here when we consider what will 
be measured as growth and how this measurement can be done. 


Biber accepts the use of pupil growth as the criterion of 
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teacher effectiveness but points up the accompanying problems: 
Granted that this effect on pupils is the ultimate 
measure of significance, it is nevertheless, true that, 
when goals are conceived on a complex level, involving 
thought-processes as well as fact assimilation and 
adaptive behavior, it is not possible to short-cut 
intermediary factors and seek for direct measures of 
teacher-effectiveness in terms of effects on pupils. 
CGe pee 215) 
It is not necessary to accept Biber's reservations completely 
to recognize that the definitions of pupil growth and the 
instruments used to measure it are of great importance when 
growth and effect are the criterion of teacher effectiveness. 
The strength in the use of pupil growth as a criterion 
remains the fact that it is possible to relate effectiveness 
of the teacher to an objective standard which is in a separate 
dimension from the quality being measured. Thus the effects 
of certain qualities of teachers and their methods of pro- 
cedure in the classroom no longer need be judged by sub- 
jective ratings of the effectiveness. With the possibility 
of quantification of measures of effectiveness, greater 


reliability and objectivity are introduced into teacher 


evaluation research. (18) 


Problems in the theory of teacher evaluation. Several 
studies have pointed to the need for a framework or a 
theoretical model within which teacher evaluation research 
may be earried on. (9, 14, 15, 19, 21) In any field of 


research, theory is necessary to gain significant results. 
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29 
Ryans enumerates four benefits from the judicious use of 
theory: (21, pp. 463-4) 

1, Theory is a guide to the formulation of hypotheses 
and trains investigators to look for facts not ordinarily 
apparent. 

2. Theory permits research to be cumulative. Newer 
studies confirm or reject older ones and older ones supply 
data and direction for newer ones. 

3. Theory indicates what studies are crucial and leads 
to the selection of fruitful hypothesis from an infinite 
number of possible ones. 

4. Theory permits research to proceed systematically and 
allows conclusions to be easily expressed and communicated. 

Ryans and others have commented on the fact that past 
research on teacher effectiveness has been characterized by 
a singular lack of theory. As a result, the findings of the 
massive number of studies which have been completed have 
lacked orientation and co-ordination. Variables have 
apparently been selected more for their convenience and 
accessibility than for their possible significance. With a 
framework of theory, meaningful variables are more likely 


to consume the researchers! efforts. 
II. TEACHER BEHAVIOR AND TEACHER EFFECTIVENESS 


Since the failure of research into character traits of 
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30 
teachers to produce significant results, it has been apparent 
that a new approach to the dynamic of teaching is necessary. 
The old approach had oversimplified and short-circuited the 
complex process by which the teacher teaches and produces 
desired growth in his pupils. Recent studies have found, 
in notions about teacher behavior, a concept broad enough to 
describe the process by which learning takes place within 


the school and the classroom. 


Teacher behavior as a variable in evaluation. One of 
the problems connected with evaluation of teacher effective- 
ness has been the difficulty of separating value judgements 
from a pure description of the situation. In setting down 
teacher behavior, it is possible to describe conditions 
within the classroom without having to rate their effects 

as being inherently good or bad. Ryans (21, p. 467) defines 
teacher behavior as "the behavior, or activities, of persons 
as they go about doing whatever is required of teachers, 
particularly those activities that are concerned with the 
guidance or direction of the learning of others." Implied 

in this definition is a relationship between teacher behavior 
and pupil behavior as a reciprocal and ever-changing inter- 
action. Thus teacher characteristics or pupil character- 
istics alone cannot explain the effectiveness of the teacher 
but rather both of these things as they are manifested 


through behavior in the classroom account for the process of 
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31 
learning which actually takes place. 

In a description of behavior, values may be intro- 
duced as a set of objectives for learning or as a set of 
norms to guide the methods used to bring about the learning, 
but these objectives and norms do not need to be made a part 
of the description of the learning process as it is observed 
to operate. In describing behavior and using it to discover 
the dynamic of learning, the researcher will necessarily 
have to accept as a reference point the goals set by the 
system within which specific teachers are working. However, 
it is still possible, as Gage and Orleans point out, that 
"within a given situation, teacher effectiveness in realizing 
specific ends is to be conceived of as embracing a variety 
of equally successful patterns." (&, p. 297) That is, there 
is no need to think of teacher behavior as a unitary thing 
or as a "common list of behavioral factors which are 
necessarily present in all effective teachers." 

Teacher behavior will, however, lend greater clarity 
to notions about teacher effectiveness. Effectiveness has 
been defined as the degree of change or growth which the 
teacher can induce in pupils. Although previous research 
sometimes used this definition it sought to relate it to 
Static characteristics within the teacher. Like growth 
itself, which must be a dynamic concept, teacher character- 


istics must be thought of as acting within a given situation 
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32 
to produce certain effects. It is not the characteristic 
itself which produces the effects but the manner in which 
it is employed. In other words, it is a certain kind or 
pattern of behavior which produces the change and growth and 
it is to behavior that we must have reference when teacher 
effectiveness is measured. 

Levin reduces teacher behavior and effectiveness to 
a three-stage process or equation. (15, pp. 101-102) The 
central area in this process involves teacher behavior within 
the classroom or school. However, prior to this behavior 
there are causative factors and forces. In the case of the 
teacher, the factors may likely be found among moods, atti- 
tudes, intelligence, personality needs, training, knowledge, 
and experiences. As a result of the behavior, certain 
consequences in terms of influences upon pupils are evident. 
These consequences can then be measured in terms of pupil 
concepts, attitudes, and abilities. The most comprehensive 
research should deal in terms of all three of these 
dimensions--the causes of behavior, the behavior itself, and 
the effects of behavior on pupils. Useful research might 
deal with the relation between the behavior and any of the 
other two dimensions. Earlier research failed because it 
attempted to relate causes and effects without taking into 
consideration the intervening classroom or behavior variable. 


For example, the factor of teacher intelligence is clearly a 
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33 
causative factor in teacher behavior. Yet after many studies, 
it has proved impossible to delineate clearly the relation- 
ship between teacher intelligence and pupil change. (7, po. 26) 
This failure is likely due to the fact that the intervening 
step involving the behavior of the intelligent teacher within 
the classroom has been omitted. It is, therefore, hardly 
surprising that two factors, separated as these are from 


each other, produce low or inconsistent correlation with each 


other. 


Teacher behavior as a variable in research. The 
complexity of problems in teacher effectiveness have, in one 
sense, not been simplified by the introduction of teacher 
behavior as a variable. Descriptions of teacher behavior 
imply knowledge of complex factors within the situation to 
which the behavior is related. If research is to be effective 
in this field, it must find a theoretical framework which 
accommodates the basic complexity of the situation. Ryans 
(21, p. 468) suggests certain formulae which convey the 
complexity and the scope of the task which is being assayed 
in using behavior as a variable. These formulae describe 
teacher behavior as a dynamic combination of such factors 

as the situation in which the behavior occurs and the 
motivation, experience, and genetic make-up of the organism. 
The representation of such factors as indices in a formula 


present difficult problems indeed. Goethals comments that 
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the complexity of research needed in dealing with the 
multiple variables of the behavior approach is justified in 
that the psychological and sociological implications of the 


problem are themselves highly complex. 


The need for independent criteria. The basic objection 
that has been lodged against much previous research in | 
teacher competence is that both the concepts and orarersa 
in which they dealt lacked clarity and objectivity. The 
criteria were frequently the rating of principals, super- 
visors, or other observers. Coneepts of effectiveness were 
limited by the criterion used and by the instruments of 
measurement. Levin (15, pp. 100-101) suggests, therefore, 
that researchers are forced to define in terms of the instru- 
ments used and in terms of the criteria employed. Research 
into teacher competence must, thus, give up notions about 
definitions in absolute terms and adopt operational defin- 
itions that have meaning in terms of the procedures followed 
and in terms of the measurements taken. He suggests further 
that, until one procedure is widely adopted and recognized 
as generally applicable, we should use the device of semantic 
studies and write teacher competence with subscripts to 
denote the variety and the means by which it was measured. 

Operational definitions are well suited to studies 
involving teacher behavior, since behavior makes no state- 


ments about inherent qualities of individuals. Thus Ackerman 
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2D 
(1, p. 286) states that traits and characteristics are of 
importance only as their effects can be “anchored to some 
sort of denotable behavior." In this way, much clearing of 
ground and establishment of useful concepts will be ac- 
complished and the whole field of research will be able to 
proceed more fruitfully than in the past. 

As a specific example of the methods of procedure and 
the application of new approaches to teacher competence 
research, Levin (15, p. 104) adduces hypotheses concerning 
teacher behavior that have been tested at Harvard. These 
hypotheses are based upon extant theory in personality and 
learning but are tested in the behavioral dimension. One of 
the most interesting hypotheses may be paraphrased as follows: 

The higher the dominance needs of a teacher, the more 

she will control the class by use of recitation 
procedures, and the more the use of recitation, the more 
the pupils will retain specific facts and the less they 
will generalize the facts. 

Through testing such hypotheses as this, much new and 


valuable light will be thrown on the problem of teacher 


competence through use of the behavioral approach. 


Summary. In summary of the problem of teacher 
competence, Remmers and his associates on the Committee on 
Criteria of Teacher Effectiveness have set forth clear 
statements of principle which point both to past errors and 
future courses of development. (19) 


1. The study of teecher effectiveness must assume the 
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possibility of different patterns of effectiveness for 
different kinds of teachers, pupils, and programs. There 
may also be a variety of patterns within any given teacher- 
pupil situation. Effectiveness must, therefore, be regarded 
as multi-dimensional and not as being limited to a unitary 
concept. 

2. The ultimate criteria of teacher effectiveness are 
accepted as lying in changes within pupil behavior, changes 
in the operation of the school, or changes in the community 
in its relation to the school. 

3. The problem of predicting teacher effectiveness is 
one of predicting that a teacher will produce certain changes 
in pupil behavior, or that a candidate for a teacher education 
program will when he is a teacher. 

4. To predict teacher effectiveness, it is necessary to 
relate teacher characteristics through teacher behavior to 
the effects produced by them. 

5. Research must include situational variables within 
the dimensions of teacher behavior. Teacher behavior is 
related to characteristics of the pupils, to the nature of 
the school program, and to a host of other situational 
factors. The hypotheses that researchers choose must, there- 
fore, become more "subtle, conservative, and realistic in 
relation to the factors that actually govern relationships 


between the behaviors and the effects of teachers." (19, p.642) 
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6. Psychological theory and theory from other disciplines 
should serve as bases for setting up of research frameworks 
and productive hypotheses. 

7. Research dealing with teacher behaviors and character- 
istics, with the effects of teacher behavior, and with the 
numerous intervening variables of the situation will have to 
face large and difficult problems of measurement. Valid 
instruments are presently not available to measure many of 
the pertinent variables and part of the task must involve the 
construction of such instruments. 

8. Research cannot be of the over-all "ideal" type 
designed to answer all questions at once but must be specific 


and related to many categories of the general problem. 
III. LEADERSHIP 


Sociologists are concerned primarily with the 
functioning of groups and one of the most interesting aspects 
of group behavior is the phenomenon of leadership. (13) 
Schools and teachers are necessarily involved with groups 
and leadership in groups in striving to accomplish the goals 
of education. Thus, measurements of teacher competence 
through a behavior variable will be involved with problems 


centered around group actions and leadership functions. 


Aspects of leadership. Leadership has long been 


regarded as an inherent characteristic or trait of certain 


ve 
esnilgioeth xedvo moxvt qrosdé bre yrosdy Tig 


avtowsm@ert darseder to oe “ot eeaads: 


=resostsdo bec etoiveded ssdoset nobw ooitbien doneoees fe 
ont Aciw das ,rekvered retiosed to esate ovkubaioesn . 
og sven [itw noivsudie sft To acids itav Sebdevresad euowemtter 
bilaV. .gnomenusesm to emoldotq PSEMIED bas egtel oost 
to yosm stvesom o¢ oldsitevs ton visnseesg eTe edneauxvent 
exit a taum er sit to cisqd bas 2eldeiisy smontéreq: sia 
Legnediidedh dowe te noivoutsanes 
eqyt "Lsebi" Ife-rsve si# Yo od sommes Aaah +° : 
oittosge ed tem: dod sono ts enottessm Lis sewens of bengteeb , 
idea Levessy eng to zelrogedso yirsm ot bedefew buns u 
qIestsonse TEI : : | 
edt d¢tw ylitsmisq bemisone9) Sts etetgolotso® ~/*) 
Sio008s uithdestosnt yeom edt to widen bis squoig to sateotvonu? 
(ff) .ctdersbeel to nonsmonsiqisnd et solvated ‘quorrg, Yo i | 
equous dtiw beviovn! peinennapetobevinsereettenee xn 


eleog ot dellqmooos of antvintde me 


socevedmos tedoeet Lo sénenes 


smoldorg ctiw beviovnl od IIb ‘ 
-snotvonu? qteewebeol boa s 


need artol aan 4 “sind 
alagien ‘Lo stad m0 2 


38 
persons who are leaders. (2, 10, 12) Such ideas are com- 
parable to those about "good" teachers in that they imply 
that there are certain necessary and unchanging qualities 
which make a person a leader or make him a teacher. However, 
both of these concepts are being modified through research. 
The concept of the "ideal" teacher is being replaced by 
notions that involve patterns of effectiveness related to 
various situations and criteria. Leadership has undergone 
a similar change in definition. 

The lack of clarity in the term "leadership", like 
the difficulty with the term "good teacher” is partly the 
result of a semantic confusion. By making the word mean 
too much, it has been impossible to use it with precision. 
Halpin distinguishes three meanings of leadership commonly 
implied in the use of the term. (12, p. 173) Leadership may 
be seen, first, as a formal position held by a person who 
occupies the "headship" in relation to a group. This is the 
appointive, political, or bureaucratic view of leadership. 
The leader of a gang of boys is a different kind of leader 
in relation to his group than is the teacher to the group 
of which he is leader and head. The former performs a social 
function in his group; the latter fulfils the expectations 
of an appointive position. A second meaning of leadership 
involves the "effectiveness" with which the leader's role 


is performed. Effectiveness, of course, must be conceived 
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39 
as being relevant to certain criteria held by the group 
itself. As different norms are held by various groups, the 
nature of the leadership which is "effective" for each group 
is also different. Finally, the word leadership is used to 
convey ideas about the capacity of the individual to perform 
the leader's role effectively. 

The word leadership, thus, tries to describe leader- 
ship, evaluate it, and, at the same time, explain it as an 
innate quality of the leader. Halpin looks for a way out of 
this problem by asking, as some researchers in teacher 
competence are doing, not what is a leader but what do leaders 
do and how do they behave? ‘Teachers are clearly leaders of 
their class groups in the formal sense of headship. It may 
also be surmised that teachers are leaders in the sense of 
being leaders of the class as a social group. In any case, 
studies of leader behavior have relevance to the evaluation 
of teacher effectiveness to the degree that they provide a 
theoretical research framework, concepts, and instruments 
through which research in teacher effectiveness may be 


carried on. 


The Ohio leadership studies. Extensive and rewarding 
studies of leadership have been carried on by the Personnel 
Research Board of the Ohio State University. (23) Their 
research has resulted in a remarkably simple instrument 


known as the Leader Behavior Description Questionnaire which 
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has been found to describe the behavior of persons in 
positions of formal leadership. The project began with the 
purpose of studying what a leader does and how he goes about 
doing it. That is, the work began with a minimum of assumptions 
about what leadership is and how it functions. The working 
definition of leadership that was adopted stated that leader- 
ship "is the behavior of an individual when he is directing 
the activities of a group toward a shared goal." (23, p. 7) 
The LBDQ was the result of the study of how the leader 
functions. 

The staff of the project began by classifying nine 
areas or dimensions of leadership. These dimensions were 
empirically derived using the staff members and ™... 
representatives of separate disciplines, each with its own 
orientation or problem approach (not to mention individual 
preferences or inelinations). ... ™ (23, p. 8) This 
procedure resulted in the derivation of 1790 questionnaire 
items related to these nine basic dimensions. These items 
were then reduced to 150 and the testing of the questionnaire 
began. A sample of 357 responses from a diversified group 
provided data for item analysis. As was suspected in the 
preparation of the items, the nine basic dimensions were 
revealed to be interrelated and not mutually independent. 
Such findings led to the factor analysis of the nine dimensions. 


Through factor analysis Halpin (23) identified two basic 
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a 
dimensions of leadership and constructed a thirty item 
questionnaire to measure these two factors. The two 
independent dimensions were called Initiating Structure and 
Consideration and are described by Halpin as follows: 

Initiating Structure-in-Interaction refers to leader 

behavior in delineating relations between himself and 

the members of his group, and in endeavouring to establish 
well-defined patterns of organization, channels of com- 
munication, and ways of getting a job done. ... 

Consideration refers to behavior that reflects 

friendship, mutual trust, respect and warmth in the 
relationship between the leader and group members. This 
represents the "human relations" aspect of leader 
behavior. (12).p.t175) 

Such apparently simple descriptions of leadership 
would seem to have little relevance for the teacher and the 
classroom. However, in completing the questionnaire about 
their leaders, group respondents were not asked to make any 
judgements about desirable or undesirable behavior. These 
dimensions, then, are justified in that they have been found 
in several instances to describe the behavior of leaders. 
Halpin sees in these dimensions of leadership several 
implications for teacher behavior. 

First, when the two dimensions of leadership were 
assessed by an external criterion of effectiveness in accom- 
plishing group goals, it was found that leadership behavior 
as described by the LBDQ and effectiveness were highly 
related. An important implication, therefore, is that 


effectiveness of teacher behavior will be “augmented only to 


such an extent as (the teacher) increases both his initiation 
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of Structure and his Consideration." (12, p. 175) 

Secondly, Halpin sees this view of leadership as a 
more reasonable and attractive alternative to the "demo- 
cratic" view of leadership which has held sway among school 
avieiceuteden fonisome eines (ihe primary duty of the 
leader in the educational setting is to lead in terms of the 
goals of the whole enterprise while maintaining the Considera- 
tion aspects of group membership. 

The essence of leading is to initiate Structure-in- 
Interaction, and to orient these structures continually 
towards the solution of group problems and the accomplish- 
ment of the goals prescribed for the group. Research 
indicates that this "Structuring" can be engaged in with 
no sacrifice of Consideration. (12, p. 176) 

Finally, leadership studies in teacher behavior may 
make it possible to train teachers to be better leaders and 
more effective teachers. This generalization is true of the 
whole behavioral approach to teacher competence. When a 
specific kind of teacher behavior can be shown to be related 
to a designated kind of effectiveness, it is possible to use 
the presence of such behavior in a classroom teacher as a 
criterion of effectiveness. Thus behavior itself can act 
as a lower level predictor of the ultimate criteria of 


effectiveness, which have been designated as involving the 


fostering of certain kinds of pupil growth. 
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IV. LEADERSHIP AND TEACHER BEHAVIOR 


Using the implications suggested by Halpin from the 
Ohio leadership studies, it is possible to investigate 
~teacher behavior from a new and promising approach. This 
approach has been adopted by McBeath (16) in his study of 
teacher leadership and effectiveness. The problem which 
McBeath studies is the relationship of teacher leader behavior 


to teacher effectiveness. 


A conceptual framework and method of investigation. 
McBeath takes the theoretical framework of his study from a 
model suggested by Mitzel. (17; 16, pp. 26-29) This frame- 
work is similar to those suggested by Ryans and Levin (21, 15) 
in that it separates levels or types of variables and indicates 
the importance of the behavioral variables. Mitzel gives 
four kinds or levels of variables. Type I Variables are 
characteristics of the teacher such as personality, back- 
ground and education. Type II Variables are factors from 
the environment of the school situation. The community, the 
school, and the pupils themselves with their attitudes, 
interests, and abilities are part of these variables. Type 
III Variables are those which deal with behaviors of pupils 
and teachers in interaction. The interaction of these 
variables is conditioned by the nature of Variables I and II 


but their interaction is dynamic and not static. Type IV 
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Variables are the standards by which the results of education 
are judged. These are the intermediate and not the ultimate 
goals set by the community. They are stated in terms of 
pupil growth, subject matter knowledge, appreciations, and 
social skills. 

The method of the study involved the random selection 
of fifty-nine teachers in twelve schools located in different 
Divisions of rural Alberta. Each of these teachers was 
rated for effectiveness and described as a leader by an equal 
number of other teachers, by their respective principals, by 
their superintendents, and ten of their own students. The 
instrument used in describing the teacher as a leader was an 
adaptation of the Ohio LBDQ. McBeath refers to this 
adaptation as the TLBDQ, The rating of effectiveness was 
gained from a six-point global rating scale. The data 
collected made it possible to relate the leadership style of 
the teacher to the effectiveness ratings provided by various 


judges for the same teachers. 


Conclusions from the McBeath study. McBeath draws 
several significant conclusions from his study, but three 
of these are of overriding importance. (16, pp. 110-13) 

1. The most important finding from the major problem of 
the study was that "teaching effectiveness appeared to be 
related to the extent of a teacher's Consideration and 


Initiating Structure behavior." The most effective teachers 
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were above the median on both dimensions of leadership and 
the least effective teachers were below the median on both 
dimensions. 

2. The amount of agreement between respondents describing 
behavior was no greater than the amount of agreement in 
describing the effectiveness of teachers. It appears, there- 
fore, that it was "just as difficult to agree on certain 
aspects of what the teacher does in the classroom as it was 
to agree on how well he teaches in general." 

3. The students however, showed consistency in rating 
the leader behavior of their teachers and "they were the 
only judges who showed some significant agreement on descrip- 
tions of behavior." They were, of course, the only judges 


in a position to judge teacher behavior at first hand. 


Implications in leadership and effectiveness studies. 
The studies of teacher leadership and effectiveness have 
clear implications for the administrator concerned with 
teacher effectiveness in current school situations but, more 
Significantly, there are also implications for further 
research. Even within the limitations of the McBeath study, 
it was evident that leadership behavior did not account 
entirely for the effectiveness of teachers. In accepting 
and working within Mitzel's framework, McBeath limited 
himself principally to Type III Variables since these are the 


focal point in the interaction of teacher and pupil behavior. 
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It is clear, however, from this and other theoretical 
models dealing with teacher behavior, that the most satis- 
factory research must deal with all levels of variables or 
at least with more than one at a time. A severe limitation 
in the McBeath study is that, while leadership was isolated 
as a measure of the behavior variable, it was not related 
to a criterion that was clearly independent of that behavior. 
In relying upon subjective judgements of effectiveness, 
McBeath limits the reliability, stability, and validity of 
the criterion. McBeath's criterion of effectiveness depends 
on a rating scale and thus the difficulties of mixing value 
judgements with objective assessment is immediately en- 
countered. The value of McBeath's research is to indicate 
the presence of a relationship between leadership and 
effectiveness regardless of how ambiguous that nature of 
that relationship may be. 

With leadership identified as a behavior related to 
teacher effectiveness, the next step in research is to relate 
leadership to the other variables suggested by Mitzel's 
model, Though past research could not establish unambiguous- 
ly the efficacy of subject matter knowledge as a factor in 
teacher effectiveness, no one has been willing to discard it 
as an important factor. The Mitzel model and the McBeath 
study, thus, clear the path for hypotheses designed to test 


the theory that subject matter knowledge can be given effect 


oa 
fsciserooid teito bos ehid mont , seven ligeaghes = a 


-aidse geom edd dst pxoidwaded onde idiadainiesligobat 
to eeldatiev to alevel Ils nviw Leeb taum doteeset yrotoet 
noitstinif ersves A .emit 2 ds suo asddi(enom ety tesef gs 
bsteloei esw asietionah eLtdw qtads at ubate aitsesGoM odd ot 
bedsiet jon asw ti ,eldsizsv yoivedsdleds-20 ommesem-ahes 
tolveded Jed? lo gnebmeqsbnt ylxselo @Bw ded’ nolusdite 6 od 
cconeviteetie to etnsavsbs; ev.itsetdwe coqe gatyfetnt - 
te Wwibilsy bes totlidsse ii Lideeiem odd ettmtl dasedoM 
ebseash asenevicosits to nofvodito elddBeGeM, saciterixo edg 
eulev xinktim Yo eoteies 1th old aud? Sngelese gator sino 
rie Yedatbenns of jnomeceees svRiashde rigiw aznemegbut, re 
eteotbat of et doresest 2'iteotolt to exbev-et? .bevetanes ; 
bas qidexebsel neowsed idee ae to sonsee7g ent 
to owden gant exouytdns a lal 997 pier diese 


y 


By dtew cit hed 


OO Se—”~6=~—S—hOO—OO—O 


od bedsler tofveded s es . 
: i n an iste 
stelet ot at dotapes: mi cide i. et — on 3 eset 
&'leetim yd bsteounue zee ak sie ur Tygal : 7 
-evountdmany detidsdes Jor biden et 00d 10s 
7 ve | at . tu a , 


at to¢9st 6 a8 embelworml + a 
7: braseth of iad need 7 
NYeeloM ede bre 
inet oF bertgiseb e ( 


Joo'tle neviy ed mao op 


ay 
4 
aa 


47 
only through a behavioral variable such as leadership. This 
supposition is borne out by conflicting studies which show 
scholarship in irregular or low correlations with effective- 
ness. (7, pp. 27-28) These results could be explained by 
the fact that a behavioral dimension such as leadership did 
not form part of the experimental design of these investi- 
gations. 

Finally, it should be pointed out that research into 
leadership behavior of teachers has implications for the 
training of teachers. If leadership is crucial to the 
effectiveness of teachers, knowledge is needed about how 
leadership is learned and how it is applied within class- 
rooms. If leadership can be trained, teacher education 
institutions must seek to develop it or at least to use it 
as a predictive factor in assessing the potential effective- 


ness of teacher training candidates. 


Implications in the present research. It is not 
possible within the scope of this thesis to undertake research 
that is entirely adequate in terms of the four levels of 
variables identified in Mitzel's model. It is hoped, however, 
that two of these levels may be related to each other in a 
way that is more rigorous than the method used by McBeath. 
From the behavioral level, leadership has been selected as a 
significant variable in teacher behavior and from the level 


of educational goals, pupil growth has been selected as an 
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L8 
important index of the degree to which such goals are being 
achieved in a classroom. It is immediately apparent, then, 
that the design of the research being reported here does not 
approach an exhaustive search of teacher effectiveness. 
However, the strengths of this research may be seen to lie 
in its orientation towards an over-all theory of teacher 
competence, in its reliance upon teacher behavior and pupil 
growth, and in its use of operational definitions of 
variables. 

It should be pointed out at this point that the 
definition of growth adopted in this study is not without 
its relevance to values and philosophies of education. The 
Alberta Grade IX examinations upon which the growth measure 
depends are primarily measures of specific material relevant 
to the course of studies prescribed in these two subject 
areas. Measurement of wider goals such as attitudes, 
appreciations, and critical thinking are not the primary aim 
of these examinations. In short, this measure of growth is 
one which reflects conservative values in education and one 
which is appropriate to the concerns felt by Bestor (5), 
Riesman (20), and the recent Royal Commission on Education 
conducted in the Western Canadian provinces. In applying 
this measure of growth, therefore, it is well to emphasize 
again the independence of "is" and "ought" in science. (11) 


That is, this research seeks only to discover what the 
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relation of the leader behavior of teachers is to such a 
measure of growth and does not imply that it is "good" or 
that such a relationship ought to exist. 

Several teachers, to whom the growth criterion used 
in this study was explained, have expressed concern over the 
narrowness and exclusiveness of the criterion. However, 
this criterion is justified in that it is an aspect of pupil 
growth that is commonly used to evaluate the progress of 
students in school. Pupil report cards, letter grades, 
marks, and the existence of provincial examination systems 
all reveal the extent to which such a criterion is an 
important value in many systems of education. This criterion 
remains only one aspect of pupil growth and the concern 
which these teachers felt was for the other aspects of pupil 
growth such as thought processes and attitudes that have 
proved so difficult to measure. (6) This research, then, 
does not attempt to make a definitive study of pupil growth, 
but merely seeks to select one aspect of the "is" of teacher 
effectiveness for study. From further research, involving 
different criteria of growth, it may be possible to improve 
administrative decisions about teacher effectiveness or to 
establish reliable knowledge when the "ought" decisions are 


considered. 
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CHAPTER III 


COLLECTION OF DATA 


The data for this study were collected during the 
months of May and June of 1960. Before proceeding with the 
collection of data, the proposed study was submitted to the 
general-secretary of the Alberta Teachers! Association who 
then wrote a letter asking for the’ cooperation of teachers 
and principals in carrying out the study. This letter is 
reproduced in Appendix B of this study. The proposal was 
similarly submitted to the superintendents of three urban 
school systems in the province. In each case permission to 
proceed was granted on the understanding that the principal 
and staff of the schools were to have the final decision as 
to whether the experiment was to be carried out in any one 
school. The letter of the superintendent of the systems 
from which most of the participating teachers was obtained 
is reproduced in Appendix C. The permission of Mr. A. G. 
McBeath, Superintendent of Schools, at Blaine Lake, Sask- 
atchewan, was obtained for the use of the TLBDQ. 

At its inception, the intention of this study was to 
secure a teacher sample of one hundred. However, a sample 
of this size proved impossible to collect and since this 
number of classes represented almost the total number of 


classes in one of these urban systems, it is possible that 
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the original sample size was too optimistically set. Since 
the collection of data was done in the last months of the 
school year during which many teachers and principals have 

a feeling of urgency in handling other matters, several 
schools decided against participation in the study. In 
view of these difficulties a teacher sample of fifty was 
decided upon and the hypotheses and purposes of the study 
were adapted to a sample of this size. It had also been the 
intention at the beginning of the study to select all the 
classes from one urban school system. Most of the classes 
were in fact taken from this system, but in order to complete 
the sample of fifty, seven classes were taken from a second 
urban system and two from a third. In the case of the three 
largest schools approached by the researcher, the decision 
was to reject the study due to the pressure of other work 


and activities in the schools. 
I. THE SCHOOLS 


The schools represented in the sample are urban 
schools in Alberta. All were selected on the basis of the 
Willingness of their principals and teachers to cooperate in 
the study. In the instances where only one teacher was used 
from the school, the principal was the teacher of the other 
subject field and was therefore not eligible for parti- 


cipation. The practice of departmentalization in many schools 
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meant that one teacher often taught more than one class of 
the particular subject and therefore the potential number of 
teachers eligible as teachers of Mathematics or Social 
Studies was smaller than the actual number of Grade IX 
classes in these subjects. In each case, the school and the 
teachers were identified by the use of a code. The key to 
this code was prepared on a preliminary visit to the school 
and was destroyed before the completed questionnaires were 
taken from the school. Thus the data in the hands of the 
researcher is identified only be means of the code number. 
Table I shows the schools identified by number and the number 
of teachers in each subject area who are part of the experi- 


mental group. 
II. THE TEACHERS 


The number of teachers of Social Studies participating 
in the study is greater than the number of teachers of 
Mathematics. This difference in the sample is due to the 
fact that the school principals were more often teachers of 
Mathematics than of Social Studies. Since principals did not 
participate as teachers, fewer Mathematics teachers are 
represented in the sample than Social Studies teachers. 

The teachers in the school who were not part of the 
experimental group have been denoted as "other teachers” and 


this group was used as one of the judges of the leadership 
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TABLE I 


PARTICIPATING TEACHERS AND CLASSES BY 
SCHOOL AND SUBJECT 
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56 
behavior of the participating teachers. For each teacher in 
the experimental group, five other teachers were called upon 
to complete a TLBDQ. Where a school was divided into 
elementary-junior high or into junior-senior high, only the 
teachers of the junior high grades were asked to take part 
as "other teachers". In some cases the other teachers were 
asked to complete TLBDQ's on more than one teacher. for 
instance, where there were four teachers in the experimental 
group in a single school, twenty other teachers would have 
been required to fill out TLBDQ'ts without replication. In 
some cases, teachers in the experimental group were asked to 
act as other teachers in regard to remaining teachers in the 
experimental group. In all cases, as broad a representation 
of other teachers was sought as was possible in the circum- 
stances. 

In most cases the other teachers completed the 
TLBDQ's at a meeting held in the school. Where this was not 
possible, the TLBDQ was explained by the researcher to the 
teachers and the questionnaires completed later by the 
teachers. In some cases, teachers expressed the view that 
their observations of their colleagues were inadequate to 
permit accurate completion of the questionnaire. In such 
cases, the teachers were asked to complete the questionnaire 
to the best of their knowledge. In a few cases, teachers 


expressed unwillingness to complete the questionnaire on the 
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grounds that to do so would constitute unethical conduct. 
In these cases, other teachers were asked to complete a 
questionnaire in their place. In most instances, teachers 
were interested, cooperative, and very generous of their 


time in furthering the project. 


III. THE STUDENTS 


The students who were part of the judging groups were 
selected randomly from one of the classes taught by the 
participating teacher. Where a teacher taught more than one 
class, the class was selected with which the teacher had 
the greatest amount of contact. In many cases, this would 
be the home-room class of the teacher. In some of the 
schools, the classes were homogeneously grouped. Since a 
regression technique was used to obtain the growth index, it 
is felt that at least part of the impact of such grouping 
was controlled in the experimental design. 

The administration of the TLBDQ to the students 
required a total of ten to fifteen minutes for both explana- 
tion and completion. This work was completed within the 
classroom with all students hearing the directions but only 
the ten designated students actually completing the TLBDQ, 
Interest in the questionnaire was high on the part of the 
students and no teachers reported difficulties as a result 


of its administration. In all cases the teachers were asked 
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58 
to remain in the room during the explanation of the TLBDQ to 
the students. In this way, it was hoped by the researcher 
that greater teacher understanding of and confidence in the 
TLBDQ would result. Some teachers remained in the room while 
the students completed the questionnaire and other elected to 
leave. The completed student TLBDQ's were collected by the 


researcher. 
IV. THE PRINCIPALS 


In every school the principal was asked to complete 
one TLBDQ for each participating teacher in his school. As 
judges, the principals were in a position different from 
that of the teacher or student judges. In the first place, 
the principal is frequently in a position where he can make 
regular observations of the teachers on his staff. Many of 
the principals in the urban systems forming part of this 
study are designated as supervising principals and their 
ratings of teachers may be regarded as a professional rating. 
In any case, none of the principals expressed difficulty in 
completing the behavior descriptions on the questionnaire or 
expressed unwillingness to make the descriptions. 

However, perhaps the most important aspect in which 
the principal judges were different from the teacher and 
student judges lies in the fact that the rating by the 


teachers and students represent an average rating. In the 
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case of students, the rating is an average of ten descrip- 
tions and in the case of teachers, the rating is an average 

of five descriptions. For the principal, however, there is 
only one behavior description for each teacher, This 
difference represents an important limitation upon the 
leadership ratings of the principal since there are varia- 
tions within group ratings of a single leader. The desired 
number of raters in a group is approximately seven for maximum 
reliability. The difficulty with a single rating of leader 


behavior, then, is in the danger of reduced reliability. 


V. PUPIL GROWTH DATA 


Pupil growth data were gathered in two parts. The 
first section consisted merely of listing by name the pupils 
Within the participating teachers' classes and coding the 
lists with the same code number assigned to the teachers. 

In almost all cases, the clerical work of listing the pupils 
was done by the schools involved while the coding was 
completed by the researcher before the list was taken from 
the school. 

The second section of gathering data for the pupil 
growth index was completed during the latter part of August. 
All of the data in this section were gathered directly from 
the Department of Education records. Although the examina- 


tions are written in June, the marking and tabulation of the 
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results is not completed until late August. Essentially 
two items of data were required for each pupil from the 
complete data compiled by the Department--the Mathematics 
or Social Studies mark of the pupil and his SCAT mark. All 
the data of the Department are recorded on IBM cards as well 
as in individual pupil files. It was decided to make use of 
the IBM cards and for this purpose the name of each pupil 
on the class lists was identified with his Departmental 
record number. This identification was done by the researcher. 

A master set of IBM cards was then prepared with each 
card containing the name of the pupil, his Departmental 
number, and the code number of the participating teacher with 
which he was connected. With these cards, the relevant 
data cards of the Department were identified and these were 
reproduced with the pupil's name, number, and class code 
also punched on them. From these cards, total distributions 
and class distributions of Social Studies, Mathematics, and 


SCAT marks were prepared. 
VI. REPORTING OF RESULTS 


The reporting of the findings of the investigation 
will be presented in the next three chapters of the thesis. 
These chapters correspond to the major areas investigated in 
the study. Chapter IV reports the leader behavior of the 


fifty-one teachers in the experimental group as their 
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behavior was described by three groups of judges. Chapter V 


reports the growth indices associated with the classes taught 
by the fifty-one teachers and Chapter VI reports the relation- 


ship between the measures of teacher leadership and the growth 


indices. 
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CHAPTER IV 
ANALYSIS OF TEACHER LEADER BEHAVIOR 


This chapter reports the analysis of the Teacher 
Leader Behavior Description Questionnaire (TLBDQ). For each 
of the fifty-one teachers in the experimental group, sixteen 
TLBDQ's were completed by different persons. Principals 
completed 51, other teachers completed 255, and students 
completed 510, making a total of 816 completed question- 
naires. Hach of these questionnaires was scored and the 
derived C-score and S-score assigned to a teacher was an 
average of the group's description of teacher behavior. 
That is, the ratings of five teachers were used to derive 
the final C-score and S-score assigned as the rating of other 
teachers, and the ratings of ten students provided the final 
C-score and S-score assigned as the rating of students. In 
the case of principals, only one description was available 


for each participating teacher. 
I. ANALYSIS OF THE TLBDQ USING DESCRIPTIVE STATISTICS 


For each teacher in the experimental group, three 
independent ratings of behavior were obtained and each of 
these teachers was assigned a C-score and S-score on the 
basis of these ratings. In examining the results of these 


descriptions, it is apparent that each judging group has 
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characteristics which tend to distinguish it from the other 
groups. The distributions of the ratings assigned by each 
group are presented in Tables II and III, with Table II 
showing the ratings of Consideration and Table III showing 
the ratings of Initiating Structure. It is apparent that 
teachers and students present a decided negative skew in the 
distribution of their ratings. That is, the teachers and 
students tended to make ratings at the high end of the scale. 
At the same time, the distributions of ratings based on 
principals' judgements, are more nearly symmetrical than 
those of teachers and students. Moreover, principals! 
ratings are made over a wider range indicating greater 
differences in their perceptions of teacher behavior than 
either the other teachers or students are willing to accord 
the same behavior. In the case of students' ratings of 
teacher S-scores, there seems to be a marked reluctance to 
recognize differences in behavior since almost half of the 
cases appear in one interval. 

Tables IV and V, pages 65 and 66, show a descriptive 
analysis of the leader behavior data for the three groups of 
judges. Table IV presents the data for the Consideration 
dimension of leadership and Table V the data for the 
Initiating Structure dimension. In the case of the means 
and quartile points, the figures have been rounded to the 


nearest whole number, Again the results show trends that 
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TABLE II 


C-SCORES OF FIFTY-ONE TEACHERS AS RATED BY PRINCIPALS, 
OTHER TEACHERS, AND STUDENTS 


PRINCIPALS OTHER TEACHERS STUDENTS 
Interval Frequency Frequency Frequency 
57-60 it 
53-56 6 
L9=52 5 8 3 
L5=-h8 8 1h 14 
h1-hh ibs) 16 14 
37-40 6 h 8 
33-36 9 6 8 
29-32 2 2 2 
25-28 @) ab Ze 
21-24 ale 


TABLE III 


S-SCORES OF FIFTY-ONE TEACHERS AS RATED BY PRINCIPALS, 
OTHER TEACHERS, AND STUDENTS 


PRINCIPALS OTHER TEACHERS STUDENTS 
Interval Frequency Frequency Frequency 
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53-56 iP iL 1 
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65 
mark each judging group as distinct from the others. The 
principals' ratings on both dimensions reflect their greater 
range and show higher variability than do the other teachers 
and students. The means of students' ratings are lower than 
the other two groups and the principals generally have the 
highest means. Over-all the other teachers and students seem 
to be more similar to each other in ratings than either is to 


principals. 


TABLE IV 


MEDIANS, QUARTILE POINTS, QUARTILE DEVIATIONS, MEANS, 
AND STANDARD DEVIATIONS FOR THE C-SCORES OF FIFTY- 
ONE TEACHERS AS DESCRIBED BY PRINCIPALS, 

OTHER TEACHERS, AND STUDENTS 


PRINCIPALS OTHER STUDENTS 
TEACHERS 
Q 3 49 46 46 
Q2 43 43 13 
Ql SY) By Sif 
Q 6.0 3.5 4.5 


M 143 LZ Ll 
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TABLE V 


MEDIANS, QUARTILE POINTS, QUARTILE DEVIATIONS, MEANS, 
AND STANDARD DEVIATIONS FOR THE S-SCORES OF FIFTY- 
ONE TEACHERS AS DESCRIBED BY PRINCIPALS, 

OTHER TEACHERS, AND STUDENTS 


PRINCIPALS OTHER STUDENTS 
TEACHERS 
Q 3 50 50 45 
Q 2 16 17 13 
Ql Al Ah 40 
Q Le5 3.0 205 
M 1.6 16 h2 
Sle UGH 4.7 Lok 


In analyzing these distributions, no attempt was made 
to go beyond the usual descriptive statistics because of the 
nature of the scale on which the measurements were made and 
because of the limited sample and evident skews in the 
distributions. The purpose of this study does not include 
the investigation of the possible significance of such 
differences as do appear. However, a measure of the con- 


sensus among the various judges was sought and to this 
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purpose, correlations among the judging groups were calcu- 
lated. Two kinds of correlations were calculated to determine 
the similarities in the ratings of the judges. First a non- 
parametric measure of correlation was sought using the 


Spearman rank correlation coefficient: rg. (3, pp. 202-213) 
IIT. NON-PARAMETRIC ANALYSIS OF TLBDQ DIMENSIONS 


The correlations among the judges’ ratings on the 
dimensions of Consideration are presented in Table VI and 
on the dimension of Initiating Structure in Table VII. The 
statistic used is the Spearman r,. (3, pp. 202-13) In each 
case, significant correlations are revealed with the strong- 
est agreement being shown between principals and teachers on 
the Consideration dimensions and between students and 
teachers on the Initiating Structure dimension. In each 
ease, the distance between principals and students is 
greater than the distance between the other two groups. 

The over-all consensus among the three sets of judges 
was determined using the Kendall coefficient of concordance: 
W. (3, pp. 229-38) The correlations for the two dimensions 
of leadership are reported in Table VIII and both are 
revealed to be significant. 

The correlations between the dimensions of leadership 
as described by the three groups of judges are reported in 


Table IX. Since the two dimensions of leadership were 
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TABLE VI 


SPEARMAN r, CORRELATIONS BETWEEN JUDGES* DESCRIPTIONS 
OF CONSIDERATION 


PRINCIPALS STUDENTS 
OTHER TEACHERS 6 * 55** 
STUDENTS 36%* 


**kSpearman r. significant at .Ol 
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TABLE VII 


SPEARMAN r, CORRELATIONS BETWEEN JUDGES! DESCRIPTIONS 
OF INITIATING STRUCTURE 


PRINCIPALS STUDENTS 
OTHER TEACHERS . 50** 67% 
STUDENTS 3 35** 


**Spearman r, Significant at .Ol 
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TABLE VIII 


CONSENSUS AMONG JUDGES ON RATINGS OF 
LEADERSHIP DIMENSIONS 


JUDGES C-SCORE S-SCORE 


PRINCIPALS, OTHER TEACHERS, 
AND STUDENTS . 69** OY tes 


**Kendall's coefficient of concordance, W, significant 
at .O1 level 


TABLE IX 


SPEARMAN r. CORRELATIONS BETWEEN CONSIDERATION AND 
INITIATING STRUCTURE AS RATED BY PRINCIPALS, 
OTHER TEACHERS, AND STUDENTS 


CORRELATION 
PRINCIPALS oh O** 
OTHER TEACHERS .36** 
STUDENTS . 38** 


**Significant at .Ol 
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derived by orthogonal factor analysis, a high degree of 
independence can be expected in the rating of leader 
behavior. (4, pp. 24-27) However, Halpin reports the cor- 
relation between Consideration and Initiating Structure to 
be .38 as revealed in the sample of 249 aircraft commanders. 
(2, pp. 1-2) As calculated using the Kendall W, all the 
judges show significant correlation in their descriptions 
of the dimensions of leadership, with the principals’ 


ratings reaching the highest correlation. 
III. PARAMETRIC ANALYSIS OF TLBDQ DIMENSIONS 


It is recognized that the nature of the measurement 
implied by the TLBD9Q scale and the skew of the resulting 
distributions do not fully meet the assumptions for a para- 
metric analysis. (3, pp. 18-34) However, to take maximum 
advantage of data, parametric analyses of the relationships 
described in the previous section were also carried out. 

_In most cases it may be seen that the results approximate 
those derived from the strictly non-parametric analysis. 
Since the parametric statistics permit further analysis, they 
have been used in the later analyses, but the attempt has 
been made to corroborate findings so obtained with other non- 
parametric analyses. 

Correlations using Pearson r's are presented in 


Tables X and XI for the dimensions of Consideration and 
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TABLE X 


PEARSON r CORRELATIONS BETWEKEN JUDGES' DESCRIPTIONS 
OF CONSIDERATION 


PRINCIPALS STUDENTS 
OTHER TEACHERS .O5** 67 
STUDENTS ie kates 


*K*Significant at .O1 


TABLE XI 


PEARSON r CORRELATIONS BETWEEN JUDGES' DESCRIPTIONS 
OF INITIATING STRUCTURE 


PRINCIPALS STUDENTS 
OTHER TEACHERS Oa Tak 
STUDENTS hex 
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72 
and Initiating Structure. As is to be expected the product- 
moment correlations are higher than the corresponding non- 
parametric correlations. (1, p. 2&7) However, the Pearson 
r's do not vary uniformly from the corresponding Spearman 
correlations, though they do maintain a similar pattern. 
Again the judgements of teacher with principals and teachers 
with students are closer than the judgements of principals 
and students. No overall consensus can be calculated using 
parametric statistics and so no statistic comparable to the 
Kendall coefficient of concordance is presented. 

The parametric analysis of the independence of the 
two dimensions of leadership as rated by the three groups of 
judges is presented in Table XII. As in the non-parametric 
analysis, the two dimensions are revealed to be significantly 
correlated, The spread between the three correlations is 
smaller in this instance, but again the principals reveal 


the highest correlation between the dimensions. 
IV. DISCUSSION 


The analysis of the TLBDQ reveals three areas of 
interest. The first is a problem of procedure related to 
the choice of either a parametric or non-parametric analysis. 
Secondly, the distributions of the dimensions of leadership 
raise certain questions about the nature of leadership. 


Finally, the distributions of the dimensions raise questions 
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TABLE XII 


PEARSON r CORRELATIONS BETWEEN CONSIDERATION AND 
INITIATING STRUCTURE AS RATED BY 
PRINCIPALS, OTHER TEACHERS 
AND STUDENTS 


CORRELATION r 


cs 
PRINCIPAL 50% 
OTHER TEACHERS Qa 
STUDENTS 25 


**Significant at .Ol 


about the groups of judges making the descriptions. 

It is recognized that the parametric analysis of the 
TLBDQ is not strictly justified in terms of the level of 
measurement implied by the instrument. However, the original 
Ohio studies violated these same assumptions in dealing in 
means, standard deviations, Pearson r's, and factor analysis. 
(4) The use of these statistics is justified to some extent 
by the empirical findings of the original Ohio studies, the 
MeBeath study, and the analysis presented here. By using a 
non-parametric analysis to arrive at the degree of consensus 
revealed among the groups of judges, it is possible to 
compare such an analysis with the results gained through a 


parametric analysis. In both cases, the results were shown 
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7h, 
to be statistically significant. Moreover, the patterns of 
the results were similar in both cases. Since the use of 
the parametric analysis permits the application of the most 
powerful tests, it is felt that the use of parametric 
analysis is justified where similar results can be shown 
through a non-parametric analysis. 

The distributions of the ratings of leadership are 
interesting first because they do reveal a strong consensus 
among the judging groups. However, there remain distinct 
qualities about the distributions of each group. The distri- 
butions of other teachers and students reveal distinct skews 
and the problem is raised of whether these skews are a 
product of the scale or whether they represent the true 
distribution of ieadsesrs in the population of teachers 
from which this sample was drawn. Similar skews are shown 
in the results reported by Halpin in educational settings. 
(2, p. 10) It is impossible to conclude from the evidence 
presented what the cause of such skews is, but it is 
interesting to note that in the principals' judgements, the 
distributions are nearly normal. 

Other characteristics of the distributions are also 
worthy of note. The principals’ ratings are the highest 
ratings, but are more variable than teachers' or students’ 
ratings. The distance between each rating is approximately 


what might be expected with the principal, who is physically 
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most removed from the area of teacher behavior, differing 
most from students who are physically closest to teacher 
behavior. The other teachers! ratings lie between the two 
other descriptions of teacher leader behavior. 

The distributions of the ratings of leadership also 
raise questions about the groups making the ratings. The 
principal is, in most cases, a person practised in the task 
of making judgements about teachers. The normality of the 
distribution seems to reflect the fact that the principals 
in these schools did not hesitate to separate teachers in 
their ratings. This spreading of the descriptions is seen 
in the greater range and variability of the principals! 
ratings. On the face of the situation, it would seem that 
the principals are making real distinctions about the 
teachers observed, 

The ratings of the other teachers and students are 
marked by their comparative shortness of range and small 
variability. Apparently, teachers and students as judges 
of leadership are unwilling to indicate great differences 
among the behavior of teachers whether they perceive them 
or not. It is, however, impossible to state the signifi- 
cance of the variability until it is compared to the growth 
criterion. All that can be said at this time is that 
apparently teachers and students see differences in teacher 
behavior in smaller units of the TLBDQ scale than do 


principals. 
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CHAPTER V 
PUPIL GROWTH INDICES 
I. PUPIL SCAT AND ACHIEVEMENT DATA 


The basic data from which the growth indices were 
calculated are the academic aptitude of pupils as measured 
by SCAT and pupil achievement in either Social Studies or 
Mathematics. For the ae pupils in the classes of 51 
teachers, four distributions of marks were derived. Two of 
these distributions represent pupil achievement in Social 
Studies and Mathematics and the other two represent the SCAT 
marks of the same students. To differentiate the two distri- 
butions of SCAT marks, one will be referred to as Social 
Studies SCAT and the other as Mathematics SCAT. These four 
distributions are presented in Tables XIII and XIV. Table 
XIII shows the achievement of 636 pupils in Grade Ix 
Mathematics. The results are the raw scores of the examina- 
tion before the usual Departmental scaling of marks has 
taken place. Also presented in the table is the distribution 
of raw score SCAT marks obtained by the same pupils. 

Table XIV shows similar distributions for 826 Grade IX pupils 
in Social Studies and SCAT. 

The distribution of Mathematics results is notable 
for its decided positive skew. Though the range of marks, 


in this case, is from 4 to 162, the mean is 61.1 and the 
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TABLE XIII 


RAW SCORE DISTRIBUTIONS OF MATHEMATICS AND 
SCAT ACHIEVEMENT 


MATHEMATICS SCAT 
Interval Frequency Frequency 
160-169 1 
150-159 5) 
140-149 4 
130-139 1k 
120-129 19 
110-119 29 1 
100-109 32 3u 
90-99 16 79 
80-89 5h LN, 
70=79 47 135 
60-69 5 119 
50=59 Bld 87 
LO-h9 70 52 
30=39 66 Tak 
20=29 Sh h 
10-19 57 
0-9 8 
N 636 636 
M Oiled: Wee 
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TABLE XIV 


RAW SCORE DISTRIBUTIONS OF SOCIAL STUDIES AND 
SCAT ACHIEVEMENT 


SOCIAL STUDIES SCAT 
Interval Frequency Frequency 
160-169 5 
150-159 20 
140-149 50 
130-139 87 
120-129 8h 
110-119 108 
100-109 109 38 
90-99 110 126 
80-89 &6 176 
70-79 5h 169 
60-69 De iol 
50-59 Syl 95 
LO=-19 ae 59 
30=39 5 11 
20-29 1 1 
N 826 826 
M 101 7h 2 
Qo 104, f exdl 


See Cie 16.6 


SO 
standard deviation 33.5. The distributions of Mathematics 
SCAT and Social Studies SCAT show slight negative skews. 

The raw score Social Studies distribution is at least fairly 
symmetrical. In view of these distributions, it was felt 
that a normalization of the raw scores would more closely 
represent academic aptitude and achievement in the sample 
than the actual raw score distributions. Therefore, the 
distributions were normalized by converting the raw data into 
T scores. (1, pp. 314-18) The resulting distributions as 
normalized are shown in Tables XV and XVI. It may be seen 
from these tables that the actual means and standard devia- 
tions of these distributions approach the theoretical values 
of 50 and 10 very closely. 

Using the normalized distributions, the over-all 
correlations between aptitude and achievement were calculated 
for the two subject areas. In the case of SCAT and Mathe- 
matics the correlation was .78 and for SCAT and Social Studies 


the correlation was .77. 


Class ability and achievement. From the normalized 
pupil data, the sum of X and Y was calculated for each class. 
That is, the total of the achievement in SCAT and Mathematics 
or Social Studies was found for each class. These totals 
were then converted to an average for the class by dividing 
the two totals by the number of pupils in the class. Thus 


the original data involving 1,462 cases were reduced to 51 
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TABLE XV 


NORMALIZED DISTRIBUTIONS OF MATHEMATICS AND 


Interval 


80-8), 
(2) 
70-74, 
65-69 
60-64 
DI O9 
50=5k 
hO-h9 
LO-hh 
I-59 
30-34 
25-29 
20-24 
15-19 
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SCAT ACHIEVEMENT 


MATHEMATICS 


Frequency 


SCAT 


Frequency 


160.1 
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TABLE XVI 


NORMALIZED DISTRIBUTIONS OF SOCIAL STUDIES AND 


Interval 


80-84 
1-19 
70-74 
65-69 
60-64 
Dah) 
D0= 5h, 
L5-L9 
LO=Lb 
Deby) 
30-34 
25-29 
20-24 
15=19 


SCAT ACHIEVEMENT 


SOCIAL STUDIES 


Frequency 


SCAT 


Frequency 


50.0 


10.0 
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83 
cases representing the participating classes and teachers. 
Distributions of the class X and Y were then prepared for 
the SCAT and achievement data. From these data it was discovered 
that each of the four distributions was significantly 
different from normal despite the fact that the individual 
cases had been normalized. Since a regression technique was 
to be used in deriving an index of growth, it was considered 
essential to further normalize the class KX and Y. This was 
done using the T scale once more. The correlations between 
the class average aptitude and the class average achievement 
were then calculated. These correlations were .93 for SCAT 


and Mathematics and .85 for SCAT and Social Studies. 


Predicted class achievement. From the data on actual 
class ability and achievement, a regression equation was 
calculated so that a measure of predicted class achievement 
could be derived. In fact, two regression equations were 
used--one in the Mathematics classes and one in the Social 
Studies classes. The data from which these equations were 
derived are presented in Table XVII. The equation derived 
for Mathematics was: Y'=.93X + 3.63. The equation derived 
for Social Studies was: Y'=.87X 4 6.46. In each case, the 
X is to be understood to refer to the average of class 
academic ability and the ¥' as predicted average achievement 
for the class. The standard errors of the estimated Y's are 


3.55 and 5.14 for Mathematics and Social Studies respectively. 
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TABLE XVII 


DATA USED IN THE CALCULATION OF REGRESSION EQUATIONS 
OF PREDICTED CLASS ACHIEVEMENT 


MATHEMATICS SOCIAL STUDIES 
M, 19.9 50.0 
My 50.1 49.9 
Sx 9.6 9.6 
Sy 9.6 9.7 
Ixy 93 285 


II. CALCULATING PUPIL GROWTH INDICES 


The predicted class achievement was then compared 
with the actual class achievement. The difference between 
the two, Y - Y', was then divided by the standard error of 
the estimated Y. (1, pp. 160-62) The result of these opera- 
tions is the growth index. However, in order to eliminate 
negative signs and to make the indices comparable in size to 
the indices of leadership, each raw growth index was multi- 
plied by 10 and the product thereof increased by 30. The 
distribution of these indices is shown in Table XVIII. 

The indices of growth indicate wide differences 
between teachers in respect to the amount of learning accom- 


plished in the classes of which they are the leaders. It is 


0.08 vi @.24 5:2 | pees G76 148 vt, a*ave | 


ate it ‘ee ‘ver na “@ 38 wf ; 
eo Ee ae 
in | : Prt ‘s @een as lev anh@ ; 
sn f ri et. aT oe 5 wag 


te Uae 


eaorant NIWORO JT8 ion Bags: J 
intel neds asw snemeve ties seats begstberq ont” endl i 


ceowded soneyeTitbh oT ,dnomevettos seals Lewtos Sid Hebe 
to tovrs brsbiede ort yd bebtvib nedd sew ,'Y - lowed ede 
~st9qe onedd 10 Ieee! (S3-08L .dq (1) VY! bedemtved! wid 
atentntle o¢ tebto nt ,twvewoH sxebnt daworry sity’ ef anots 
od otle nt sldetaqmes seo tbr pehrartiabere sip 
~islum eow xebnit déwor wet dose’ ,qbi 
od? ,O& Yd bedeeront- toored? vouborg’ ods nigeoran 
-LIIVY olde? mk nwotle et a aa te 
seonoreTi ts ebiw : 
2moo0 6 yitter ser Yo wnweiie 
et 3] .etebasl ete 


ome 
Hed a a i 


we ny im 


’ 
4 


+ ie 


85 
TABLE XVIII 


GROWTH INDICES FOR FIFTY-ONE TEACHERS AND THEIR 
CLASSES IN MATHEMATICS OR SOCIAL STUDIES 


MATHEMATICS SOCIAL STUDIES TOTAL 
Interval 
0 1/ at i 
52-54 it a 
49-51 @) ) 
46-48 @) 8) 
13-15 5 6) 5) 
LO=-42 @) 2 2 
37-39 2 2 h 
34-36 6 3 S 
31-33 1 Z 3 
28-30 2 7 9 
25-27 2 6 g 
22=-2h @) AL al 
19-21 2 a 3 
16-18 2 8) Z 
13-15 2 2 L 
10-12 ib ik 
N 22 29 Sl 


difficult to determine the significance of these differences. 
The growth indices may in part be the result of interaction 
between the teacher and the class, however, to some extent, 
the index of growth may be thought of as a quality of the 


teacher. To illustrate the possible impact of various growth 


[ f “Seek ; 
; ; =e 
a" | 7 
0 iia © Sansa 
y “9 . 5 | SyleE sl 7 
i nwt 
S § 6 Sa 
ry § $ ' eee | 
© a ) . 
e ) ¢ Db dei aoa ea > S 
7 y s OE é 
i a § TS-2S 
; fo 
: 0 S Biot a 
al S /& (R168 o 
i f 4 AnVag SL-OL » 7 . 


fe es of Pe ee 
, ; oo haiipes Hag 


's mwas 


-AONSTSTILb seeds % eonsoltingle edd eah 
nolsosrstial lo tigeet eds ed dr. 1 
.J09IKS emoa oF |, tevVewod ,& Lo 

odd to Viileup 6 ee 
ddworg euolrey Yo gonqme 


86 
indices with classes of various abilities, Table XIX has been 
prepared. The projected class achievements were calculated 
by working in the reverse direction through the formulae for 
growth and regression used in the procedures of the study. 
Thus instead of deriving an index of growth, this is taken 
as given, and, using given values for XK and the predicted Y, 
the actual pupil achievement is calculated as a dependent 


variable arising from class ability and its growth index. 


TABLE XIX 


PROJECTED CLASS ACHIEVEMENTS DERIVED FROM THE INTERACTION 
OF GROWTH INDICES AND CLASS ABILITY 


MATHEMATICS SOCIAL STUDIES 
Growth Growth 


Class 65 Class 65 
Ability Ability 


50 


In Table XIX, three growth indices were selected in 
each subject area. These three are, in each case, the 
highest, the lowest, and an average growth index actually 
recorded in that subject area by teachers. The ability 


indices are arbitrarily chosen points on a T scale represent- 
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87 
ing extremes of ability. The projected achievement scores 
are assumed to approximate the T scale. It may be seen from 
the table that low growth indices depress class achievement 
and high indices increase it. This procedure is not a 
completely satisfactory one but it does serve to underline 
the differences in class achievement associated with 
productive and non-productive settings with classes of 


varying ability. 


III. DISCUSSION 


Two main issues are raised in appraising the results 
of the measures of growth. First there is the issue raised 
by the normalization of the pupil and class data. By 
normalization, apparent differences in the raw data are 
minimized in some instances and in others are increased. 
However it is felt that this procedure does less violence 
to the true distribution of the traits in the sample than 
do the observed raw score distributions which are sometimes 
inexplicably skewed. For example, the raw score Mathematics 
distribution reveals a startling positive skew, while the 
SCAT distribution for the same sample reveals a near normal 
range of ability. It is felt that normality is a defensible 
assumption to make about the abilities measured in the samples 
and therefore, that the use of the normalized distributions 


is justified. 
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The second issue is raised by the averaging of ability 
and achievement within the class to produce more readily 
manageable data. Unfortunately, the result of such procedure 
is really the loss of data. The mean does not convey the 
range of data in the class nor the variability of it. In 
speaking of a similar problem, Mitzel and Gross comment that 
"this violation of statistical independence among the 
variates masks real differences that may exist among the 
means of classes." (2, p. 20) The fact that the correlation 
among class X and Y's is higher than the correlation among 
individual pupil X and Y's seems to corroborate the effect 
of masking which results from dealing in means of classes. 
That is, the means do not adequately represent the distri- 
butions from which they are drawn and obscure real differences 
among the classes. Despite this weakness, it was decided to 
proceed using the results derived from class means, though 
such a procedure necessitates some caution in the final 


interpretation of results. 
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CHAPTER VI 


LEADERSHIP AND GROWTH 


The crucial section of this study is the chapter 
reporting the relationship of the measures of teacher leader 
behavior to pupil growth. This relation was investigated 
in two sections with two procedures involved in each section. 
The first section takes the dimensions of leadership 
separately and seeks to establish the relation of each to 
growth. The second section investigates the effect of the 
joint functioning of the two dimensions upon growth. In 
each of the two sections a parametric and a non-parametric 
analysis are used. The parametric techniques used are 
Pearson's r and multiple correlation. The non-parametric 


techniques used are chi-square and Fisher's exact probability. 


I. GROWTH AND THE DIMENSIONS OF LEADERSHIP 


This section of the chapter is concerned with the 
relation of growth to the dimensions of leadership taken one 
at a time. The parametric analysis is given first and is 
followed by the same relationship analyzed by a non- 


parametric procedure. 


The parametric analysis. Six Pearson r correlations 
represent the relation of growth to the two leadership 


dimensions for each of the three judging groups. Pearson's r 
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91 
was used in the correlations as the most powerful analytical 
device available in this regard. The results are shown in 
Table XX. Each judge's rating of Consideration and Initiat- 
ing Structure is significantly correlated with growth. It 
should be noted that, since the original hypothesis predicted 
the direction of the correlation, the levels of significance 


are assigned from one-tailed tests. 


TABLE XX 


CORRELATIONS OF GROWTH INDICES WITH THE LEADERSHIP 
DIMENSIONS OF CONSIDERATION AND 
INITIATING STRUCTURE 


CORRELATION lee CORRELATION los 
PRINCIPALS . 26% . 28% 
OTHER TEACHERS eat o ble 
STUDENTS se aor: 


*Significant at the .05 level 
**Significant at the .01 level 


None of the correlations presented in Table XX 
represent significant differences from any other correlation 
in the table. However, it is interesting to note that the 


Consideration dimension of all three judges correlates 


£e 
fsotsylens Iyhaeien vaon 49 a2 enetzaterien et mt bean soe 


ingle || sedeed bel adeno: aon tongteea! ems 


: 
7 
me “rds awh PATH) ra rls ae . a 


id yeperer nf Rigen tira oe 


soe Cane | 


are cr 


Xx oidalt ik Seguecesd inne 
notselor109 teddo Yns mor? 
eit tart etoa 


92 
uniformly with the growth measure and that all three are 
Significant at the .05 level. For students and other 
teachers, the correlation of Initiating Structure with 
growth reaches the .0l1 level of significance. Apparently, 
then, the Initiating Structure dimension is more strongly 
related to growth than is Consideration. 

The use of parametric statistics in relating growth 
to leadership raises the same questions as have been raised 
previously in the analysis of TLBDQ data. Figures 2 and 3 
are the scattergrams relating growth to students! judgement 
of Initiating Structure and Consideration. These figures 
are presented to give a graphic representation of the 
correlations represented in Table XX. Only those for 
students are presented, since they are representative of the 
scattergrams derived for the other two groups. Figure 2 
shows the relation of growth to Initiating Structure. The 
scattering is diffuse, but enough linearity is apparent to 
be consistent with the reported Pearson r of .43. Figure 3 
shows the relation of growth to Consideration. The scatter 
is about as diffuse as in the case of Initiating Structure, 
but the presence of several highly atypical scores may account 
for the lower derived correlation of .25. 

The distributions which went to make up the cor- 
relations of growth with the dimensions of leadership are 


definitely skewed ones. This has already been discussed in 
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CORRELATION OF GROWTH AND INITIATING STRUCTURE AS JUDGED BY STUDENTS 
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95 
relation to the leadership dimensions. It is apparent, too, 
from Table XVIII, page 85, that the growth indices represent 
a non-normal distribution. In view of this situation, two 
different procedures are open to the investigator. First, 
the distributions may be normalized. This was considered 
to be an unjustified step since it is difficult to determine 
what the nature of these two traits is in the population. 
Normalization in this instance may produce more distortion 
than it eliminates, Secondly, it is possible that the 
correlation ratio is the most appropriate statistic to use 
in analyzing the scattergrams of growth and leadership 
dimensions. However, the number of cases involved is only 
half of that considered adequate for the correlation ratio 
with the result that any correlation would likely represent 
a serious inflation of the real correlation. (2, p. 296-7) 
Moreover, the correlation ratio does not permit further 
analysis through multiple correlation. In view of these 
difficulties, neither of these procedures was followed and 
the Pearson r's were accepted and used in further multiple 


correlation of leadership and growth. 


The non-parametric analysis. After the parametric 
analysis of the relation of growth to leadership was cal- 
culated, a non-parametric analysis of the relation was sought 
as well. The procedure involved dichotomizing both growth 


and the dimensions of leadership. The two leadership 
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96 
dimensions were divided into high and low groups. The 
growth indices were divided into high, low, and average 
groups with only the high and low groups being used in the 
analysis. Four-fold tables were produced for each of the 
three judging groups and the resulting distributions 
analyzed by chi-square goodness of fit tests. The results 


of this procedure are given in Table XXI. 


TABLE XXTI 


re VALUES FOR THE RELATIONSHIP OF GROWTH TO THE 
DIMENSIONS OF LEADERSHIP 


CONSIDERATION STRUCTURE 

AND GROWTH * AND GROWTH * 
PRINCIPALS ole 1.88 
OTHER TEACHERS a aby h29** 
STUDENTS 52 9.5 * ex 


*af=1, N=34 
**Significant at .05 
‘er Sienitieant at .Ol 


The results of the chi-square analysis are somewhat 
different from the Pearson r correlations. Consideration 
disappears as a significant dimension entirely and Structure 


is significant only in the judgement of other teachers and 
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students. In evaluating these results, it should be 
remembered that the chi-square analysis is considerably less 
powerful than the Pearson r where the assumptions of Pearson 
can be met. In any case, Structure emerges from both the 
parametric and non-parametric analyses as a stronger factor 


in relation to growth than is Consideration. 
II. LEADERSHIP STYLE AND GROWTH 


In the immediately preceding section, the dimensions 
of leadership have been taken separately and related to 
growth. Implied in the concept of leadership style is the 
idea that the dimensions are mutually supportive and that in 
conjunction they are more effective than in separation. 
Leadership style is derived from quadrants of a graph 
representing the distribution of teachers according to 
their C-scores and S-scores. One graph is derived for each 
of the three judging groups. The distribution of teachers 
on these graphs is shown in Table XXII. As in the previous 
section, the analysis of the data in this section proceeds 


on both a parametric and a non-parametric basis. 


The parametric analysis. In applying multiple 


correlation, the two dimensions of leadership, Consideration 
and Structure, are used as independent variables and growth 
as the dependent variable. The intercorrelations used in 


determining R are those derived from the correlations of the 
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two dimensions of leadership to growth and from the correla- 
tions between the dimensions themselves. These intercorrela- 
tions are reported in Table XXIII. The actual multiple 
correlations, R, are reported in Table XXIV. Since the 
multiple correlation coefficient is subject to inflation, 


the corrected coefficient, .R, is reported in the same 


Cc 
table. (2, p. 399) Significant correlations are evident for 
students and other teachers. (1, p. 301) Leadership behavior 
as judged by principals does not produce a significant 


multiple correlation. 


TABLE XXII 


DISTRIBUTION OF FIFTY-ONE TEACHERS WITHIN THE QUADRANTS OF 
LEADERSHIP STYLE AS DESCRIBED BY PRINCIPALS, 
OTHER TEACHERS, AND STUDENTS 


QUADRANTS OF OTHER 
LEADERSHIP PRINCIPALS TEACHERS STUDENTS 
I +C uy) dep) 12 
+5 
LE +C eZ i155) 13 
-S 
Jed -C 13 eS 13 
-S 
IV -C 13 12 13 
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TABLE XXIII 


INTERCORRELATIONS USED TO DETERMINE MULTIPLE CORRELATIONS 
OF GROWTH WITH CONSIDERATION AND STRUCTURE * 


PRINCIPALS OTHER TEACHERS STUDENTS 
Bee 26 2k -25 
Tos .28 Ak 4.3 
Yes 50 «42 “45 
* g Growth ec Consideration s Structure 
TABLE XXIV 


MULTIPLE CORRELATIONS OF GROWTH WITH CONSIDERATION 
AND STRUCTURE * 


Rees ot 
PRINCIPALS sy 25 
OTHER TEACHERS 6 5b he 
STUDENTS bee LO** 


**Significant at .05 


* g Growth e Consideration s Structure 
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Table XXV shows the coefficients of multiple deter- 

mination. According to Guilford, such a coefficient "tells 
us the proportion of variance in X, that is dependent upon, 
or associated with, or predicted by X5 and X4 combined with 
the regression weights used." (2, p. 397) In this case, 
we may interpret these coefficients as the contribution 
which Consideration, Structure, and both dimensions together 


contribute to growth as a dependent variable. 


TABLE XXV 


COEFFICIENTS OF MULTIPLE DETERMINATION USED IN PREDICTING 
GROWTH FROM CONSIDERATION AND STRUCTURE 


PRINCIPALS OTHER TEACHERS STUDENTS 


CONSIDERATION 4.16% 1.61% 1.77% 
STRUCTURE 5.6% 18.2% 17% 
COMBINED 9.76% 19.8% 18.9% 
Re es ail 6 oye 4 


*Multiple R's significant at .05 


For the two significant judging groups, the dimensions 
of leadership in combination contribute somewhat less than 


one-fifth of the total growth variance. Of the total 
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variance, Consideration contributes very small amounts. 
Structure contributes much larger amounts. In the judge- 
ments of principals, the amount of variance contributed by 
the dimensions is approximately equal, but the combined 


amount contributed is not significant in the final R obtained. 


Non-parametric analysis. Corroboration of the 
results found through the multiple correlations was sought 
by applying Fisher's exact probability to graphs of leader- 
ship and effectiveness. These graphs represent distri- 
butions made according to three factors--Consideration, 
Structure, and Growth. The basic graph is made up of the 
dimensions of leadership. Within each of the quadrants of 
leadership style, teachers have been categorized as high, 
average, or low in growth. The number of teachers in each 
guadrant is fixed, but the kind of effectiveness which they 
display is free to range in the three groups of high, low, 
or average growth. 

In using Fisher's exact probability, the teachers of 
average effectiveness were disregarded so that growth could 
be dichotomized as high or low. To create a second dichotomy, 
the quadrants were compared two at a time. Thus the necessary 
four-fold tables are produced in which the Fisher statistic 
may be calculated. Figures 4, 5 and 6 represent the graphs 
of leadership and effectiveness according to the three groups 


of judges. Tables XXVI, XXVII, and XXVIII represent the 
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results of the tests using the Fisher exact probability. 


FIGURE 4 


EFFECTIVENESS OF FIFTY-ONE TEACHERS WITHIN A GRAPH BASED 
ON PRINCIPALS' RATINGS OF THE DIMENSIONS OF LEADERSHIP 


Note: H denotes a teacher high in effectiveness. 
KX denotes a teacher of average effectiveness. 
L denotes a teacher of low effectiveness. 
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FIGURE 5 


EFFECTIVENESS OF FIFTY-ONE TEACHERS WITHIN A GRAPH 
BASED ON OTHER TEACHERS! RATINGS OF THE 
DIMENSIONS OF LEADERSHIP 


Note: H denotes a teacher high in effectiveness. 
K denotes a teacher of average effectiveness. 
L denotes a teacher of low effectiveness. 
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FIGURE 6 


EFFECTIVENESS OF FIFTY-ONE TEACHERS WITHIN A GRAPH 
BASED ON STUDENTS! RATINGS OF THE 
DIMENSIONS OF LEADERSHIP 


Note: H denotes a teacher high in effectiveness. 
X denotes a teacher of average effectiveness. 
L denotes a teacher of low effectiveness. 
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TABLE XXVI 


LEADERSHIP AND GROWTH ANALYZED BY FISHER'S EXACT 
PROBABILITY USING PRINCIPALS’ RATINGS 
OF THE DIMENSIONS OF LEADERSHIP * 


L H L H L H 

i 3 8 I 3 8 iu 3 8 

A h L, tel 6 2 TV 3 
L H L H L H 
18) 3 TY? & 3 JIE 6 2 
AE 6 2 IRGIG e/ L, sEAEAL L L 


*None of the distributions are significant. 


TABLE XXVII 


LEADERSHIP AND GROWTH ANALYZED BY FISHER'S EXACT 
PROBABILITY USING OTHER TEACHERS! RATINGS 
OF THE DIMENSIONS OF LEADERSHIP * 


L H L H L H 
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TABLE XXVIII 
LEADERSHIP AND GROWTH ANALYZED BY FISHER'S EXACT 


PROBABILITY USING STUDENTS! RATINGS 
OF THE DIMENSIONS OF LEADERSHIP 


L H L H 

I i My 6* if iL 6 
Jie JE 7 3 BN PR tga 6 
L H L H L H 

EYAGS 2 6 yer el? 6* 1k 7 3 
Tet if 3 JEU 7 2 Teele 7, 2 


**Significant at the .025 level 
*Significant at the .05 level 


The three graphs show patterns similar to those which 
have already been revealed in the analysis up to this point. 
That is, the other teachers, and students show considerably 
more strength in discerning teacher effectiveness through 
leadership than do principals. The principals' judgements 
about leadership are almost a randomization as far as 
effectiveness is concerned. In the case of other teachers 
and students, the patterns associate high effectiveness in 
Quadrant I and low effectiveness with Quadrant III. That is, 


the degree of growth attained by a class is associated with 
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the style of leadership exhibited by the teacher. 

These patterns are seen more plainly in the tables of 
the analysis by Fisher's exact probability. Once more there 
is a progression in patterns of effective judgement from 
principal through to other teachers and students. The 
principals! judgements represent a randomization that is 
some distance from significance levels. In the case of 
other teachers, none of the distributions reach the level of 
Significance but patterns of superiority are apparent. In 
students! judgements, Quadrant I is superior to Quadrants 
II and III and Quadrant IV is superior to Quadrant III-- 
all at significant levels. 

In analysis by Fisher's exact probability, it is 
differences between diagonals of the four-fold tables which 
indicate significance. It is clear, then, that there is 
little to choose between Quadrants I and IV in effective- 
ness. Both show patterns of high effectiveness. Conversely, 
Quadrants II and III reveal patterns of rather low effective- 
ness. Thus the greater strength of Structure over Considera- 
tion in inducing growth is again apparent, though, in this 
case, both are jointly associated with pupil growth in 


Quadrant I. 
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III. DISCUSSION 


The first problem raised in this chapter of the study 
is the strength of association between Consideration and 
pupil growth. Four pieces of evidence have been adduced in 
relating Consideration and growth and in all of them the 
relationship has been tenuous at best. The strongest indi- 
cation of a significant relation comes from the correlation 
by Pearson's r of Consideration and growth. However, even 
here the correlations run at about .25 and therefore account 
for only 6 per cent of the total variance. 

In the non-parametric analysis of Consideration and 
growth, this dimension of leadership disappears entirely 
as a Significant variable. The chi-square analysis employed 
here is less powerful than the Pearson r but one might expect 
from the parametric analysis that Consideration would appear 
with more strength than is implied in the chi-square values 
of almost zero. 

In the analyses of leadership style, Consideration 
again bears a minor role in relation to growth. The multiple 
correlation assigns only two per cent of total growth 
variance to Consideration. In the Fisher exact probability, 
patterns of significance are revealed when Consideration and 
Structure are combined in Quadrant I but it is impossible 
to differentiate the part played by the two dimensions. 


On the other hand, Structure emerges clearly as a 
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predictor of growth. In both the parametric and non-para- 
metric analyses, this dimension is continually associated 
with growth. Since Structure is correlated with Considera- 
tion, it is possible that the strength which is shown by 
Consideration is merely a result of its overlap with the 
Structure dimension, 

A second area of interest raised in this chapter is 
the degree of success attained by the various judges in 
predicting growth. The principals are obviously the weakest 
group. They emerge as a significant group only in the 
Pearson correlations between growth and the separate leader- 
ship dimensions. In the other three analyses, including 
the powerful multiple correlation, the principals are 
conspicuously unable to predict pupil growth through their 
judgement of teacher leader behavior. This result is the 
more surprising in view of the fact that the principals’ 
judgements of leader behavior had a high face validity.* That 
is, their judgements had a greater range and normality than 
the judgements of other teachers and students. 

Although the principals represented here were willing 
to differentiate the behavior of teachers to a greater 
extent than teachers or students, such judgements were less 
significant in terms of pupil growth than were the judgements 


{ 


*See Section IV of Chapter IV, pp. 72-75. 
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of teachers and students. However, one would be wary about 
generalizing to the population of principals from the 
present sample. The number of principals represented in the 
sample was only nineteen and they were using an instrument 
intended to be used in conjunction with the judgements of a 
group of like persons. Thus the superior success of students 
and teachers may only be due to the fact that such judgements 
were the product of group ratings not single judgements. 
Nevertheless, in the common practice of rating teachers, 
such rating is almost invariably the product of a single 
judgement. It has been argued that since this single 
judgement is based upon frequent observation, mature judge- 
ment, and long experience that the results acquire validity. 
This validity is not reflected in the case of a small 
sample of principals rating the leader behavior of their 
teachers. 

Finally this chapter raises the question of whether 
the statistically significant relationship between leader- 
ship and growth represents practical significance in school 
and classroom situations. The multiple correlations 
indicate that the two dimensions of leadership are associated 
with approximately twenty per cent of the variance in growth. 
In assessing the importance of this contribution to growth, 
it should be borne in mind that such potentially powerful 


factors as previous pupil learning, total school environment, 
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socio-economic background, motivation, teacher experience, 
and teacher training were not controlled. These factors 
were not controlled partly because of the difficulty of 
making controls on such pervasive factors and partly because 
it was the purpose of this study to focus upon teacher 
behavior. That a single aspect of teacher behavior should 
emerge as a measurable and significant factor is an indi- 
cation of its importance in the school setting. 

Moreover part of the difficulty in relating leadership 
to growth may stem from the fact that the index of growth, 
though it is independent and objective, is not entirely 
satisfactory as an indicator of pupil change. The SCAT and 
achievement measures were taken concurrently at the end of 
the school year. The SCAT, as a predictor of pupil success, 
may itself be subject to the same teaching measured in the 
achievement score. Therefore a purer measure of pupil 
capacity might work more efficiently than SCAT as the xX 
variable in the growth index. Another change in the growth 
index that might strengthen its performance would be to 
employ two achievement measures as X and Y variables. Thus 
the X taken early in the year and the Y taken late in the 
year might operate as a better indicator of rate of change 


in growth than does the present index.* 


*See pp. 10-11 for a discussion of growth indices. 


rae aan 
,2onelteqxe isdosesd ,no idevbtom bavortgtosd ‘otmonose=0ts08 - \ 
erosost seed? ,belflorines ton ‘ovew Si ledend redo seg bas u 
to yiyotitin edt Yo eaysosd yigwsq Bellowinep Son -e1ew 
eausood thoreg bas axotont ‘eviesvseq sows ino \aLoxsmon geblem 
reddened noqur exaot oft vbuse. sini. 46 iepoetpevedibiame aki 
| Slvode stoiveried tedoset to toeqes elyate © tsi? .tolveded 
~ibei ns at rodesh dasottingie bus efdeisesem s es Syteme 
 asettdea Loodoe odd nt sonsdtoqmt edt Ye motte. 
giterebacd anktelex ni ytiuoltith edd %o sis¢ teyeetoM os) 2 
(‘Sdworg to xebst odd tedt tos? odd movdomedte yam deworg ot : 
_ Vienidte dom eh ,evitostde bas sasbasqebat ab ti dyvorts 
bas TAQR adh segneio Leauq ‘Yo todsokbnt ae 98 yrovostehtss Ce 
to bre edt ae yidnoriwonoo need onew eerwesem InemevelioR 
,22e0oue Liga to todo tbeng & ‘say PTADe en?) msey Lootow sift | 
edt ot bervesem antiosed emse edt o¢ Sootdue ed Least yam | 
Liquq Yo stwesom rem 8 stotetsdT .etcce Jnemeve tio 
X edt en TADE recs yLinetottte exom arow drigte yttoaqeo | / 
ddwors end of egmeds roddoma .xebmt igworg odd) ot efde trey 
_ ot 9d bLivow! spaenrtotreq 20d: nedegae ute: Ing te wae: rebee 
aut? ,eeldsizey) Yobos x on newur som gnemevedstos: ows: toLaqe ee 
odd at etal nedad Yond bre tssviedd at ¥ | | odd 
earedo to oJe4 te rota that redded mas. 


_ 


_ 


: 
~-S ee eins see ns 
.ceotbat déworg ermeer 


oy 
idl 


¢ 


aly? 
REFERENCES 


Ferguson, George A. Statistical Analysis in Psychology 
and Education. New York: McGraw-Hill Book Company, 
imei LOO 


Guilford, H. P. Fundamental Statistics in Psychology 
and Education. New York: McGraw-Hill Book Company, 
Inc., 1956. 


yes eg 


, i. 
_ ih earest Om ise 4 u ) 


+9 not: bedeubt 


» 
b, 
wv 
a 
ra 
~ 
—. 
4 


i 


CHAPTER VII 
CONCLUSIONS, IMPLICATIONS, AND RECOMMENDATIONS 


The aim of this study has been to investigate teacher 
behavior and pupil growth within a theory of leadership and 
within a systematic framework of teacher effectiveness. The 
theory of leadership is that derived from the Ohio studies 
of leadership and the framework of effectiveness is that 
developed by Mitzel. This framework of effectiveness 
identifies teacher behavior as a focal point linking pupil 
growth to other teacher and environmental variables, The 
behavior element selected for study in this thesis is the 
leader behavior of teachers and the purpose is to relate 
such behavior to pupil growth. 

The Ohio theory of leader behavior involves two 
dimensions of leadership. One of these, Consideration, is 
defined as behavior oriented to the maintenance of personal 
security, friendliness, and satisfaction in group members. 
The second, Initiating Structure, is defined as behavior 
establishing goals, institutional demands, and orderly 
procedures. In investigating teacher behavior, the dimensions 
of leadership have been related to effectiveness through a 
criterion based on pupil growth in two school subject areas, 
The growth achieved in these areas is that measured by a 


set of external examinations when the academic aptitude of 
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pupils is held constant. Thus the criterion of effectiveness 
is determined independently of ratings of teacher behavior 
supplied by three groups of judges--principals, other 
teachers, and students. 

The study involved an experimental group of fifty- 
one teachers whose behavior as leaders was described by the 
three groups of judges using the Teacher Leader Behavior 
Description Questionnaire. The teachers were all part of 
urban school systems and were teachers at the Grade IX level 


of either Social Studies or Mathematics. 
I. CONCLUSIONS 


The conclusions following are those derived from the 
testing of the null hypotheses of the study. 

(1) There is significant agreement among principals, 
other teachers, and students in their descriptions of teacher 
leader behavior. At the same time, it was noted that each 
of the descriptions had characteristics peculiar to itself. 
Therefore, though there is statistically significant agree- 
ment among the judges, it is apparent that there are some 
differences in principals', teachers', and students! per- 
ceptions of the leadership of classroom teachers. This is 
further borne out by relation of these judgements to growth. 

(2) Initiating Structure as a dimension of leadership 


is related to pupil growth. This dimension of leadership 
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appeared significant in each of the tests relating leadership 
to growth. It survived both the parametric and non-para- 
metric analyses designed to test this relationship. 

(3) Consideration as a dimension of leadership is 
Significantly related to growth but the extent of this 
relationship is smaller than that shown by Initiating 
Structure. The parametric analyses of Consideration revealed 
its relation to growth to be significant though limited in 
scope. That is, Consideration is associated with only a 
very small proportion of total growth variation even 
though that proportion is statistically significant. In the 
non-parametric analyses, Consideration performed less 
strongly than in the parametric analysis and displayed only 
an ambiguous relation to growth through its association with 
Initiating Structure. . 

(4) Other teachers and students performed more success- 
fully than principals as judges of leadership in predicting 
pupil growth through teacher leadership. Students performed 
most consistently in predicting growth through their descrip- 
tions of teacher leader behavior. Other teachers showed 
Similar strength but principals were unable to predict 
growth except when the dimensions of leadership were 
considered separately. Even when the dimensions were taken 
separately, the strength of the principals judgement was 


weaker than might be expected from a group so favourably placed 
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as principals are. The weakness of the principals' judge- 
ments may be due to two factors: the sample of principals 
was small and their ratings of leadership were not modified 
by other ratings in the same groups as was the case with 
other teachers and students. 

(5) Leadership which is high in Structure or high in 
both Structure and Consideration is related to pupil growth. 
Initiating Structure is thus the common element in both 
styles of leadership but until further analysis of growth 
is conducted it is premature to state that Consideration is 
unrelated to growth. 

The conclusions of the study, related to the major 
research hypotheses, may be generalized into two statements: 

(1) Principals, other teachers, and students are able 
to reach significant agreement about the leader behavior of 
classroom teachers. 

(2) Teacher leader behavior is significantly related to 


pupil growth. 
II. IMPLICATIONS 


Leadership and growth. The study showed that approxi- 
mately one-fifth of the total growth variance was associated 
With qualities of leadership within the classroom teacher. 

The question arises of how important this is in administrative 


arrangements providing for teaching and learning situations. 
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The answer to this question is suggested if we recall how 
leadership was defined. It is the way a teacher goes about 
the business of teaching, the daily procedures, the very 
general techniques and methods of dealing with groups and 
the individuals in them. In short these are areas about 
which very little is known in terms of the effects which 
they have upon pupil learning. Much has been assumed and 
much good advice exchanged about behavior in these areas, 
but little has been achieved that can be described as clear 
and positive knowledge about the consequences of these 
behaviors. If it can be shown that these behaviors are 
related to pupil growth in some fashion, then this is 
important knowledge which may take a significant place in 
administrative decisions. 

It is obvious from this study that much more knowledge 
is needed about leadership and growth. It is possible that 
teacher variables such as personality, training, scholarship, 
and teaching experience are also related to pupil growth 
through leader behavior. It is possible, too, that there 
are differences in subject matter areas, in grade levels, 
and in school environment that effect the relationship of 
leadership and growth. It is impossible to regard the 
present knowledge about leadership and growth as in any way 
complete and, in using such knowledge as is presently avail- 


able, the administrator is well advised to use it as an 
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indicator of possible arrangements rather than as a reliable 


tool clearly measuring the results of teacher behavior. 


The nature of leadership. The partial success of 
this and other studies in educational leadership raises the 
question of what precisely the meaning of leadership is. 

The origin of the measures of leadership indicates that we 
have answers to this question only at the level of opera- 
tional definitions. Such definitions are extremely useful 
in research situations, but if the results of such research 
are to have wider application and use in educational settings, 
the attempt will have to be made to establish leadership 

in other terms. Leader behavior in the classroom is 
presently limited to interpretation through the TLBDQ and 

no attempt has been made as yet to interpret leadership in 
terms of a particular subject matter or in terms of a 
program of teacher training. It is likely premature at this 
time to make such an attempt but such will ultimately have 
to be the aim if leadership is to have useful application in 
educational administration. 

If leadership can be reinterpreted from its opera- 
tional definition in the TLBDQ, it is possible that leader- 
ship can be considered as part of the professional training 
of teachers. We do not know what leadership means in 
psychological terms and we do not know the origins of 


leadership in an individual's behavior, but it is apparent 
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that there are patterns of teacher behavior related to the 
professional problem of transmitting knowledge and of 
inducing pupil growth. Knowledge and mastery of such patterns 
of behavior can thus form important tools for prediction, 
control, and improvement of outcome in the administration 


of education. 


The nature of growth. It is well to emphasize at 
this point that the definition of growth which was accepted 
for investigation in this thesis is by no means a final or 
ultimate one. It was selected as one which is of clear 
importance in education and as one which is open to relative- 
ly easy and accurate measurement. It is, however, narrower 
than the total range of growth which is often considered 
part of the aims of education. That is, the growth measured 
in this study has emphasized mastery of skills necessary for 
successful performance in two subjects included in a set of 
province-wide examinations. It may well be that there are 
other wider kinds of growth that can not be measured by 
such examinations. These kinds of growth might be identi- 
fied as creativity, critical thinking, citizenship, health, 
and other vague though important values. The impact of 
leadership upon these kinds of growth may be different from 
that investigated in this thesis. Such investigations must 
wait upon adequate measures of these other kinds of growth 


and even upon their adequate definition. In the meantime, 
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it is well to remember that the behavior investigated here 
has been a particular kind, the growth has been specific, 
and therefore that the effectiveness referred to is limited 


to this framework. 


The TLBDQ. As an instrument for the investigation of 
teacher behavior, the TLBDQ has performed as a highly 
effective tool. It is not, however, an instrument without 
flaw or limitation. As an infallible guide to the rating 
of teachers it is obviously inadequate. It was designed and 
must remain an instrument for research procedures. It 
depends to a critical extent upon the unreserved coopera- 
tion of those using it. Without the complete confidence of 
respondents, the reliability and validity of the instrument 
is destroyed. The anonymity of the teachers involved is 
thus an essential feature of the standardization of testing 
procedures for the TLBDQ. Moreover, in reviewing the results 
of the analyses, it is apparent that there are several 
highly atypical cases where growth and leadership are not 
at all consistent. It is impossible by using the TLBDQ 
alone to detect such discrepancies and it is further unlike- 
ly that global ratings of the teacher would be more success- 
ful than the TLBDQ in resolving such conflict. As an instru- 
ment, then, the TLBDQ requires cautious interpretation of 
its results and further investigation into its reliability 


and validity. 


OSL , | ideas 
e1sd bedezttzewnt roivaded-ert Jedt redmomen os Liew ab SE” 
vitioeqe need esd dowomg ede ,babl relvoteueg © need eed 
bedtimil at od beristet SHonevisoetic ond Jans onetoreds ‘bms, 
Oey: ck eS _ elrowomenYezay 08 


>!) PAP RAVE See ier fete f ‘ 
t i rl rf 
6 dokscaiseers ont a martial ne eA ‘Saar st 


| Udgid 5 28 bemmetxeg sad’ OGIT end vsotveded sedoe0y 
Sao ttw docu Sate bre revewod «tom ef aT) loot evidostts < 
gnit¢es: edd of ‘eb tim ealdilistat op eA Mottstimsl 10 welt 
bre benytesh epwidT». «odsupebent yLewo lds ‘at St evetoses Yo 
seetibesong do1s8eet 10% dnomrrgenk ne wbamoy’ gexit 7 
~£18qooo ‘bevireeeiny ed nogw dmregxe fsoisiio & ov abneqeb 
to sonebiiace etsiquoo ond. tuondtiW Vth gutew eeons to note 
Joommdant eft Io v9 ibis bas YeREtdetTew ed¥ \ednebuogae 
et bevlovnt e@tedosst ons to ytlayoons ed? .beyorvaeh at 5 il 
goideed Yo sotdaethrsbnste et to CxuiRet Letedeees we suid 
etiveor aft gniwetve: at visvoetoN .DEGUT odd uot BeiembeDoNy _ 
leveve2 stg siedd sede Jastedge eh df ,eoeylens edd Yo ‘ 
ton exe qiderebael bos déworg cede esee deo tyyon: vit 
PaHIT sit gatew yo oldteeoqmt ef 41 
-oxtiny tedciwt ef If bos cetoheqetdelb’ 
~s2sooue otom sd blvow totoset eft 2o- 
-widgert m6 eA .tokl neo dowe satvlowor ¢ 
lo aoliaverqrecnt | eg: roo 


yititdstier eat 
al ¢, <8 ugeyea4 


Ayal 
III. RECOMMENDATIONS 


1. Further studies of the kind undertaken here are 
recommended, That is, studies in effectiveness are recom- 
mended which work through the leader behavior of teachers 
and which relate such behavior to a measure of pupil growth. 
At the same time, wider or different measures of growth may 
be used in an attempt to broaden the application of the 
theory within the framework of effectiveness. 

2. The use of more powerful and sophisticated techniques 
in the calculation of growth indices is also recommended. 
Regression based on individual pupil results rather than 
upon class averages is an obvious means of strengthening the 
indices. Similarly the use of analysis of co-variance is 
appropriate since it is a method which makes full use of the 
data available in individual pupil scores. 

3. Item analysis of the TLBDQ is also called for before 
more extensive use of the instrument is undertaken. The 
availability of the growth indices make possible an item 
analysis in terms of the criterion as well as in terms of 
the results of the instrument itself. 

Lh. An investigation of the stability of the growth 
indices is also recommended. At present it is not known if 
the index is associated with the teacher, the class, or the 
interaction of the two. Such an investigation would be 


possible if growth indices were calculated using a single 
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class with various teachers. Follow-up studies from year to 
year with the same teachers or with the same class would 
give valuable information about such stability as well. 

5. The language of the TLBDQ is presently too difficult 
for lower grades and for classes with reading difficulties. 
Further studies might attempt to construct alternate forms 


or levels for the TLBDQ. 
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APPENDIX 


APPENDIX A 
TEACHER LEADER BEHAVIOR DESCRIPTION QUESTIONNAIRE 


The TLBDQ consists of thirty items distributed equally 
between the two dimensions of Consideration and Initiating 
Structure. The items from each dimension are presented 
randomly on the questionnaire. Each item is marked on a 
five point scale to indicate the frequency with which the 
behavior mentioned in the item occurs, The observer is thus 
describing the behavior of a leader by indicating the 
frequency with which certain designated behaviors occur, 

The leadership displayed is therefore the degree to which 
certain behaviors are present as judged by a group of 
observers. To retain the quality of each dimension the 
items are presented together. The numbering is that of the 
items on the TLBDQ. 

Frequency of occurrence of a behavior is shown on the 


following scale: A--Always 


B--Often 
C--Occasionally 
D--Seldom 
B--Never 


The numerical values assigned on the scale run from 0 to 4 
with the highest values being assigned to the highest fre- 
quencies of occurrence. The total possible score on each 
dimension is therefore 60 and the lowest 0. The fifteen 


items on the Consideration dimension are as follows: 
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He helps students with their personal problems. 

He does little things to make it pleasant to be a 
member of the class. 

He is easy to understand. 

He finds time to listen to students. 

He talks with students outside of class. 

He looks out for the personal welfare of individual 
class members. 

He gives students reasons or explanations for his 
actions. 

He acts without consulting the class. 

He praises students. 

He treats students fairly. 

He is willing to make changes. 

He is friendly and approachable. 

He makes students feel at ease when talking with him, 
He puts suggestions made by the class into operation. 
He gets students to help in the planning of class 
activities. 


Note: Item #16 is scored negatively. 


The fifteen items on the Initiating Structure 


dimensions are as follows: 


28. 
30. 


He makes his attitudes clear to the class. 

He tries out his new ideas with the class. 

He rules with an iron hand. 

He criticizes poor work. 

He speaks in a manner not to be questioned. 

He assigns class members to particular tasks. 

He schedules the work to be done. 

He maintains definite standards of performance. 

He emphasizes the meeting of deadlines. 

He encourages the use of uniform procedures. 

He makes sure that his job as teacher is understood by 
all class members. 

He asks that students follow standard rules and 
regulations. 

He lets class members know what. is expected of them. 
He sees to it that class members are working up to 
capacity. 

He sees to it that the work of the class is organized. 
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APPENDIX B 


ALBERTA TEACHERS' ASSOCIATION CLEARANCE 


April 22, 1960 


To Whom It May Concern: 


I have studied the thesis proposal "Teacher Leader 
Behavior and Its Relation to Effectiveness as Measured by 
Pupil Growth" planned by Mr. Greenfield. The rating of 
teachers proposed there is to relate certain behaviors to 
pupil achievement. The purpose is a research purpose, not 
related to merit rating for pay. 


Your cooperation in assisting Mr. Greenfield in the 
conduct of his study is requested. You will be assisting 
in the furthering of knowledge about education. 

Very sincerely yours, 


ALBERTA TEACHERS! ASSOCIATION 


Executive Secretary 


- diane 
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APPENDIX C 


PERMISSION OF SCHOOL SUPERINTENDENTS 


April 28, 1960 


Mr. t. B. Greenfield, 
Faculty of Education, 
University of Alberta, 
Edmonton, Alberta. 


Dear Mr. Greenfield: 


I have your interesting letter of April 20 briefly 
describing your research project on which you are embarked. 
I shall be happy to recommend it to the junior high school 
principals for careful consideration but, of course, they 
must be left free to decide for themselves whether or not 
they wish to participate. I think one of your main problems 
will be that of presenting this proposal to the schools in 
such a way as to win their co-operation and support. Certain 
aspects of the investigation may be deemed by teachers to 
be of a threatening nature. These will require very careful 
explanation, 


Yours very truly, 


Assistant Superintendent. 
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